IMPERIAL AGRi 
RESEARCH INStrlfH 





BIOMETBIKA 


A JOURNAL FOR THE STATISTICAL STUDY OF 
BIOLOGICAL PROBLEMS 

FOUNDED BY 

W. F. R. WELDON, FRANCIS GALTON and KARL PEARSON 

EDITED BY 

KARL PEARSON 

ASSISTED BY 

EGON SHARPE PEARSON 

VOLUME XXV 

1933 


ISSUED BY THE BIOMETRIKA OFFICE 
UNIVERSITY COLLEGE, LONDON 
AND PRINTED AT THE 
UNIVERSITY PRESS, CAMBRIDGE 



PRINTED IN GREAT BRITAIN 



CONTENTS OF VOLUME XXV 


Memoirs 


« PAGE 

I. A Study of the Naga Skull. By Elisabeth Kitson and G. M. 

Morant 1 

II. The Albanians of the North and South. (1) Introductory Account 

of Measurements and Photographs taken in 1929. By Miriam 
L. Tildesley. (2) Discussion of Miss Tildesley ’s Measurements. 

By the Staff of the Biometric Laboratory .... 21 

III. A Comparison of the Semi -invariants of the Distributions of Moment 

and Semi -in variant Estimates in Samples from an Infinite 
Population. By John Wish art 52 

IV. An Empirical Age Scale. By Drysdale Anderson ... 61 

V. The Probability Integral of the Correlation Coefficient in Samples 

from a Normal Bivariate Population. By F. Garwood . . 71 

VI. A Further Note on the Relation between the Median and the 

Quartiles in Small Samples from a Normal Population. By 
Tokishige Ho jo ... 79 

VII. A Further Study of Methods of Constructing Life Tables when 

Certain Causes of Death are Eliminated. By M. Noel Karn . 91 

VIII. A Test of the Significance of the Difference of the Correlation 


Coefficients in Normal Bivariate Samples. By Fred A. Brander 102 

IX. Plural Births with a New Pedigree. By Julia Bell . . .110 

X. On Correlation Functions of Type III. By S. D. Wicks ell . . 121. 

XI. On the Parent Population with Independent Variates which gives 

the Minimum Value of <*>* tor a Given Sample. By Karl 
PeaRvSON 134 

XII. The Skulls from Excavations at Dunstable, Bedfordshire. By Doris 

Dingwall and Matthew Young 147 

Xlli. On the Application of the Double Bessel Function r , (#) to 

Statistical Problems. By Karl Pearson . . . 158 



Contents 

m 

The Cranial Coordinatograph, the Standard Planes of the Skull, 
and the Value of Cartesian Geometry to the Craniologist, with 
some Illustrations of the Uses of the New Method. By Karl 
Pearson 

XV. A Study of Twelfth and Thirteenth Dynasty Skulls from Kerma 

(Nubia). By Margot Collett 

XVI. On the Likelihood that one Unknown Probability exceeds another 
* in view of the Evidence of two Samples. By William R. 

Thompson 

XVII. On Asymptotic Formulae for the Hypergeometric Series. I. Hyper* 

geometric Series in which the fourth Element, x , is Unity. By 
O. L. Davies 

XVIII. The Body Build of American-born Japanese Children. By P. M. 

Suski .... 

XIX. Methods of Statistical Analysis appropriate for k samples of two 

Variables. By E. S. Pearson and S. S. Wilks . 

XX. On a Method of determining whether a sample of size n supposed 

to have been drawn from a Parent Population having a known 
Probability Integral has probably been drawn at random. By 
Karl Pearson 


iv 

XIV. 


Miscellanea 

(i) Adjustments for the Moments of J-shaped Curves. By 

W. Palin Elderton 

(ii) Note on Mr Palin Elderton’s Corrections to the Moments of 

J-Curves. By Karl Pearson 

(iii) A General Expression for the Moments of Certain Sym- 

metrical Functions of Normal Samples. By R. C. Geary . 

(iv) A Statistical Study of the Daucus Carota L. By William 

Dowell Baten 

(v) On a Property of the Mean Ranges in Samples from a 

Normal Population and on some Integrals of Pro£ T. Hojo. 
By Prof. V. Romanovsky 

(vi) Note on the Shrinkage of Physical Characters in Man and 

Woman with Age, as an illustration of the use of P 
Methods. By Pamela C. V. Lesser 


FACIE 

217 

254 

286 

295 

323 

353 

379 

179 

180 

184 

186 

195 

187 



Contents 


v 


PAOB 

(vii) On the Distribution of Student's Ratio for Samples of Three 
Drawn from a Rectangular Distribution. By Victor 
Perlo . * * - 208 

(viii) The Distribution of in Samples of Four from a Normal 

Universe. By A. T. McKay 204, 

(ix) Note on Mr McKay’s Paper. Editorial .... 210 

(x) Note on the Fitting of Frequency Curves. Editorial . . 213 

(xi) The Distribution of y8 2 in Samples of Four from a Normal 

Universe. By A. T. McKay 411 

(xii) A Note on the Distribution of Range in Samples of n. By 

A. T. McKay and E. S. Pearson 415 

(xiii) On a Recurrence Relation connected with the Double Bessel 
Functions 9? Tl ,r # («) and T TlfTa (#). By CONSTANCE M. 
Rigby 420 


List of Plates , etc . 

Biometrika Portrait Series, No. X. Sir William Petty, 

Knight, painted by J. Closterman and engraved by I. Smith Frontispiece 

(a) E . Kitson and G. M. Morant: A Study of the Naga Skull. 

Plate I. Normal Males, Normal Females and Juveniles, 

Naga Crania ....... to face pcLge 1 

Plate II. Typical Male Naga Skull. Norma facialis . (R.C.S. 

6*6231) ......... „ „ 6 

Plate III. Typical Male Naga Skull. Norma lateralis. (R.C.S. 

6*6231) ....... „ „ „ 

Plate IV. A. Typical Male Naga Skull. Noi*ma verticalis. 

(R.C.S, 6*6231). 

B. Male Naga Skull (R.C.S. 6*6232) with Wormian 

bones in place of nasal bones. 

C. Male Naga Skull (1927 series, B.L. No. 44) 

showing the unerupted third left molar hori- 
zontal and preventing the second molar from 
erupting „ „ „ 

One folding table of Individual Measurements of Naga Skulls „ „ 20 

Two cranial contours on tissues in pocket at end of volume. 



vi Contents 

(f3) M. Z. Tildesley: The Albanians of the North and South. 

Plate I. Type Silhouette of the Northern Albanian Group to face page 42 

Plate II. Type Silhouette of the Southern Albanian Group „ „ >, 

Plate III. Albanians of the North (1st Series), Moslems . „ „ 50 

Plate IV. Albanians of the North (2nd Series), Moslems 

and Catholic „ „ „ 

Plate V. Albanians of the South, Moslems and Orthodox 

Greek . „ „ 

Two type contours on tissues in pocket at end of volume. 

(7) Julia Bell: Plural Births with a New Pedigree. 

Plate of Pedigrees of Plural Births „ „ 117 

(8) D. Dingwall and M. Young: The Skulls from Excavations at 

Dunstable, Bedfordshire. 

Folding Plate of Male Mandibular Measurements of Dunstable 
Skulls „ 154 

Plate I. Typical Male Skull, No. 18. Norma facialis . „ „ „ 

Plate II. Typical Male Skull, No. 18. Norma lateralis „ „ „ 

Plate III. Typical Male Skull, No. 18. Norma verticalis . „ „ „ 

Plate IV. Typical Male Skull, No. 18. Norma occipitalis . „ „ „ 

Plate V. Typical Male Skull, No. 18. Norma basalis . „ „ „ 

Plate VI. Male Mandible, Skull No. 18. (a) Norma verticalis „ „ „ 

(6) Norma lateralis v „ „ „ 

(e) William Dowell Baten : A Statistical Study of the Daucus 

Carota I* 

Plate I. Flower and Bracts of Daucus Carota . . „ „ 180 

(f) K. Pearson: The Cranial Coordinatograph and the Standard * 

Planes of the Skull, etc. 

Plate I. Normae facialis et occipitalis of a Hindu Skull, 
showing how widely a plane bisecting the 
Auricular Axis and perpendicular to it diverges 
from the “mid-sagittal points" 252 

Plate II. Norma lateralis (R. and L.) of a Hindu Skull, 
showing the customary transverse Vertical Plane 
and Frankfurt Plane 



Contents 


vii 


Plate III. Normae verticalis et basalis of a Hindu Skull, 
showing deviation of Plane through mid-porion 
perpendicular to Auricular Axis from the “mid- 
sagittal points” on these aspects of the Skull . to face page 252 


Plate IV. Plan and Elevation Models of the chief " mid- 
sagittal points’' for a Fuegian and an Egyptian 
(Nubian) Skull 


Plate V. Plan and Elevation Models of the chief “mid- 
sagittal points” in the cases of an Arab and a 
Teita Negro Skull ...... 

Plate VI. Plan and Elevation Models of the “ mid-sagittal 
points” of an English and a Hindu Skull 

Plate VII. (a) First Form of Cranial Coordinatograph with 
Skull on Skull Staddle and independent 
Projectors. (6) Final Form of Cranial 
Coordinatograph, combining apparatus for 
rendering vertical any Cranial Line with 
a Pearson Projector . 

In both figures the Coordinatograph is ar- 
ranged so that the Auricular Axis of the 
Skull is perpendicular to the Plan. 


Plate VIII. Sytnmetricised portraits of Oliver Cromwell, and 
of his Death' Mask 


Plate IX. Full-Face Natural Portrait of a Lady with sym- 
metricised portraits of Left and Right sides of 
Face . . . . # . 

Plate X. Natural Norma facialis of an Egyptian Skull with 

symmetricised Normae faciales of Right and 
Left sides 


» »> » 


» » » 


(y) Margot Collett: A Study of Twelfth and Thirteenth Dynasty 
Skulls from Kerma (Nubia). 

Plate I. Typical Male Kerma Skull. Norma facialis . „ „ 284 

Plate II. Typical Male Kerma Skull. Norma lateralis 

(Left Profile) „ „ 

Plate III. (a) Typical Male Kerma Skull. Norma verticalis. 

(b) Male Skull, showing Holes supposed to have 

been made by Insects after Death . . . „ „ „ 



Contents 


vili 

Plate IV . (a) Male Skull with wounds on the Parietal and 
Malar Bones. ( b ) Female Skull with large 
Wormian bones between the Temporal Squama 
and Parietal Bone . • * . . to face page 284 

Plate V. (a) Female Skull, showing diseased areas on either 
side of the Lambda. (6) Male Skull, showing 
constriction of the baei-occipital, where it 
unites with the Sphenoid, (c) Male Skull with 
a wound below the Right Orbit, (d) Female 
Skull, showing failure of the Tympanic Ele- 
ments to unite „ „ 

Plate VI. (a) Male Skull with the sockets of two teeth other 
than molars lacking. (6) Female Skull with 
no sockets for lateral incisors, (c) Female 
Skull with no sockets for the left canine and 
right third molar, (d) Male Skull with sockets 
for three incisors only „ „ 

With four folding tables of Individual Measurements „ „ 

Six contours in pocket at end of volume. 




Biometrika, Vol. XXV, Parts I and II 

Elisabeth Kitson, A Study of the. Naya Skull 


Plate I 



Normal Males (top row), Normal Females (middle row) and Juveniles (bottom row) Naga Crania. 



Volume XXV 


MAY, 1933 


PABTS I AND II 


BIOMETRIKA 

A STUDY OF THE NAGA SKULL. 

By ELISABETH KITSON, B.A., M.Sc. 

With the assistance of G. M. MORANT, D.Sc. 

In 1926 an expedition organised by the Government of Burma was sent into 
the Naga Hills for the purpose of suppressing the practice of human sacrifice there. 
A collection of skulls and other bones of the victims of sacrificial rites was made 
and these remains were despatched to the Indian Museum, Calcutta, in 1928. They 
were studied there by B. S. Guha and P. C. Basu and their report has recently 
been published*. It is said (p. 4) that: “Including small fragments the total 
number of bones was 217, of which 21 were whole or portions of arm and leg bones, 
31 small pieces and 117 and 43 frontal and occipital parts respectively of skulls. 
Only five of the skulls had the cranial vaults complete, though even in these the 
infra-occipital region and the greater part of the basis cranii have been removed/' 
Of these cranial specimens 65 were loaned to the Biometric Laboratory by the 
courtesy of the India Office and the Government of Burma, and they form the 
subject of the present paper f. On account of the very incomplete nature of nearly 
all the specimens, the numbers for which the majority of the usual measurements 
can be found are considerably less than the complement of 65. Owing to the 
kindness of Sir Arthur Keith, I was able to supplement my original measurements 
of Naga skulls by those of seven specimens in the Museum of the Royal College 
of Surgeons. In calculating means it was also possible to use some of the 
individual measurements given by Guha and B^su for the crania collected in 1927 
which were not sent to the Biometric Laboratory and some of their measurements 
of three Naga skulls in the Indian Museum, Calcutta (Table XV of the Report ). 
Measurements provided in the following sources were also included: 

(a) Sir William Turner: “Contributions to the Craniology of the People of 
the Empire of India. Part I. The Hill Tribes of the North-East Frontier and the 
People of Burma/’ Transactions of the Royal Society of Edinburgh , Vol. xxxix, 
Part III, No. 28 (1899), pp. 703 — 747. Individual measurements are given on 
p. 720 of one female and seven male Naga skulls., 

* “ A Report on the Human Relies recovered by the Naga Hills (Burma) Expedition for the Abolition 
of Human Sacrifice during 1926—27.” Anthropological Bulletins from the Zoological Survey of India , 
No. I (July, 1981). 

f A male incomplete calvaria — consisting of oomplete frontal, right and left parietal and left temporal 
bones together with the greater part of the oooipital — was sent to the Biometric Laboratory with the 
Naga remains. This bore no number or inscription on arrival and it was subsequently numbered 18. It 
does not appear in the report cited and it iB of a different type, and presumably of a different race, from 
the crania described there. 

Biometrika xxv 




2 


A Study of the Nag a Skull 

(i b ) Prof, (afterwards Sir) George D. Thane: “On some Naga Skulls.” The 
Journal of the Anthropological Institute , Vol. xi (1882), pp. 215 — 219. Individual 
measurements are given of four Naga skulls in the Museum of the Royal College of 
Surgeons which I have re-measured and of one other specimen not in that museum. 

The Naga Hills are in the province of Eastern Bengal and Assam close to the 
north-west frontier of Burma. All the crania collected in 1927 were taken from 
villages in Burma lying in an area, known as the Triangle, within which human 
sacrifice was practised before that date. This is shown on the map in the Report 
and its position is indicated on the map (Fig. 1) on p. 3. Little reliable information 
regarding the origin of the victims could be obtained. It was ascertained that they 
did not belong to the tribes living within the sacrificial area, but that they probably 
came from adjoining districts to the west and south-west. According to one chief 
the victims belonged to the Singpa, Wakka, Himhku, Nukpa, Yaugngaw and 
Kyetsan tribes. It is safe to assume that the majority of the remains represent 
head-hunting Naga tribesmen, but those of a few captives or stray foreigners from 
other parts may be included. It is said in the Report that one or more holes had 
been drilled through each of the fragmentary skulls so that a string might be 
passed through in order to suspend the relic (see Plate I). At least two of the 
specimens (numbered in the Biometric Laboratory 44, the anterior half of a cranium, 
and 66, a supra-occipital) have no such holes, however. The three skulls in the 
Indian Museum, for which measurements are provided by Guha and Basu, are 
supposed to be those of Angami Nagas and these were collected near Kohima 
which is one hundred and thirty miles south-west of the Triangle. 

I am indebted to Miss M. L. Tildesley, Curator of the Department of Human 
Osteology, for the following particulars of the Naga skulls in the Museum of the 
Royal College of Surgeons. No. 6 621 (Flower’s 1907 Catalogue No. 652 1 ) was taken 
from Ninu (95° 18' E., 26° 47' N.), a Konyak Naga village thirty tniles to the west 
of the Triangle. It was decorated with twelve rings of wire attached to the zygomatic 
arches, orbits and nasal cavity. It is complete with the lower jaw and it has not 
been pierced for suspension. This is probably the skull of a member of the Konyak 
tribe and not that of a sacrificial victim *. Nos. 6*6221 and 6*6222 were taken from 
the Konyak Naga village of Chongvi (94° 49' E., 26" 32' N.) fifty miles to the west 
of the Triangle. They were decorated with horns and tassels and the former still 
has these ornaments attached. The posterior parts of the calvariae are missing and 
No. 6*6222 has a hole pierced through the frontal bone and its mandible is fastened 
on with strips of bamboo. The two individuals represented had certainly been 
sacrificial victims and it is believed that they were probably Konyak Nagas. No. 
6*6231 (Red No. 793 * No. 773 in Barnard Davis’s Thesaurus Craniorumf) is the 
complete skull of a freebooter who was shot on a plundering expedition about the 
middle of the nineteenth century. The tribe to which the individual belonged is 

# Dr J. H. Hutton has given his opinion regarding the origin of the Naga skulls in the Royal College 
of Burgeons and several of the remarks quoted here are on his authority. 

t It is Baid in this catalogue that the “ occipital, atlas and dentata” are all ossified together. However, 
the atlas of the specimen, as Professor Thane has previously noted, was never fused to the oociput. 



Elisabeth Kitson and G. M. Morant 



BURMA & ADJOINING COUNTRIES SHOWING THE NASA TERRITORY 

FIGr. 1. 


1—8 








4 


A Study of the Nag a Skull 

not known, but he is most likely to have been an Angami Naga. No. 6*6232 (Red 
No. 794«*No. 774 in the Thesaurus Gramorum) is the complete cranium (without 
the mandible) of a youth who had been a servant to Colonel Hanney. He is said to 
have been about eighteen years of age at the time of his death and the tribe to which 
he belonged is not known. No. 6*6233 (Red No. 796 = No. 1760 in the Supplement 
to Thesaurus Graniorum) is the complete cranium without the mandible of a Naga 
named Lentee. “He was murdered, it was supposed, by his woman.” The tribe to 
which this individual belonged is not known but Lentee is said to be a common 
name among the Ao Nagas, while it is seldom, if ever, met with among other tribes. 
The Ao Nagas live to the south-west of the Konyak Nagas. Nos. 6*6234 and 6*6235 
are two almost complete crania, without mandibles, presented by Dr J. H. Hutton. 
They were taken from the cemetery of the Ao Naga village of Mongsemyimti (ca. 
94° 45' E., 26° 27' N.) in 1928. This is approximately 60 miles west of the southern 
corner of the Triangle. 

The skull E in Professor Thane's paper is the only one he describes which is 
not in the Royal College of Surgeons, and it is said to have been obtained from the 
same neighbourhood as No 6*621 in the museum there. Hence it is probably that of a 
Konyak Naga. The eight skulls dealt with by Sir William Turner were taken from 
the house of a Tonkal Naga in the upper village of H wining which is some forty miles 
north-east of Manipur. This is approximately one hundred and fifty miles south-west 
of the Triangle. The custom there is to bury the dead, so these specimens were 
evidently trophies although they are almost complete crania. They are believed to 
be those of Tonkal Nagas from other villages. It is probable that the vast majority 
of these crania, for which measurements are now available, are those of Nagas of 
various tribes who lived in an area some one hundred and fifty miles long and 
eighty broad extending north-east from a point twenty-five miles north of Manipur 
and lying parallel to, and possibly across, the frontier between Bujfma and Assam 
(see map, Fig. 1). A few individuals from outside this area, and possibly some who 
were not Nagas, may have been included. The sample is too small, and the particulars 
regarding the origin of the specimens are too indefinite, to make any comparisons 
between different groups of Nagas possible. As far as I could judge from the skulls 
I was able to handle, the group is a racially homogeneous one and the measurements 
of the total sample appear to suggest the same conclusion. All the material was 
pooled for the purpose of providing mean measurements. 

The selected group of the skulls collected in 1927 which was sent to the Biometric 
Laboratory bore no numbers or inscriptions by which they could be identified on 
arrival. They were numbered serially (1 — 66) there, and nothing further was done 
with No. 18 for the reasons stated above (p. 1, footnote). From the photographs 
provided in Guha and Basu’s Report I was able to identify twelve of the specimens 
with certainty*, and what may be called the corresponding London and Calcutta 

* Viz. the skulls numbered 2, 8, 8, 14, 29, 84, 85, 86, 37, 47, 51 and 56 in the Biometrio Laboratory. 
Our No. 51 corresponds to No. N. 4 of Ouha and Basu and it does not appear in our table of individual 
measurements as it is an unsexed fragment. They also give photographs of skulls which they numbered 
N. 95, N. 170 and N. 182, and these were not sent to the Biometric Laboratory. 



Elisabeth Kitson and G. M. Morant 


5 


numbers for these will be found in Table VII of individual measurements given at 
the end of this memoir. By comparing measurements it was also possible to identify 
twenty additional specimens with a sufficient degree of probability. The majority of 
the remaining thirty-three crania sent to the Biometric Laboratory, for which Guha 
and Basu’s numbers cannot be found, are those of juvenile individuals and measure- 
ments of these are not given in the Report . The technique of measurement followed 
there is that of the Monaco Congress *, with a few modifications and additions; and the 
majority of the definitions used may be supposed identically the same, for practical 
purposes, as those of the biometric scheme which I followed. We are thus given an 
opportunity of comparing the measurements taken by different observers on the same 
thirty f skulls, though nearly all these are, unfortunately, very incomplete. The 
distributions of the differences between Guha and Basu’s measurements and ours are 
given in Table I for fourteen characters and it can be seen at once that there is a 
deplorably bad correspondence in nearly every case. It is known from laboratory 
practice that if measurements on the same skulls are repeated by a single observer, 
or if they are taken by two different observers, then the maximum differences found 
should never exceed 2 mms. in the case of the characters considered. This will only 
be so, of course, if the workers have been adequately trained and if they interpret the 
definitions of the measurements in precisely the same ways. It is evident that these 
conditions do not hold in the present instance since only three of the distributions 
of differences lie within the limits — 2*05 and -f 205. My queried measurements 
were omitted in compiling the table and there are none such given by Guha and 
Basu. Where my reading differed from theirs by more than 15 mms. it was taken 
independent^' by Dr G. M. Morant and these comparisons confirmed the fact that 
my measurements had been taken in accordance with the customary biometric 
technique. Some of the larger differences in Table I, particularly in the case of 
LB and S x , are almost certainly due to errors of 5 or 10 mms. in reading a scale, 
but other large ones can only be attributed to the fact that Guha and Basu had 
radically different conceptions of the ways in which the measurements were to be 
taken from ours. The discordance is most marked in the case of the palatal length 
and we cannot imagine what definition can have been applied which would give 
consistently smaller readings than the Monaco longueur de la voUte palatine which 
is our G x and Martin’s chord from staphylion to orale. It would clearly be unsafe 
to accept all Guha and Basu’s measurements of the Naga crania collected in 1927 
which were not sent to the Biometric Laboratory. For the purposes of computing 
means I used the values of B', LB, J , S lt G'H, NB, 0 X L, L , B, 8%, 8 X , 8 /ml 
and /mb given for these skulls and their values for the first seven of these measure- 

* G. Papillault: “ Entente Internationale pour 1’ Unification des Mesures oraniom6triques et c6phalo- 
m£triques.” Congrbs International d* Anthropologic et d* ArchSologie PrShistorique, Compte Rendu de la 
treizitme Session , Monaco 1906 . Tome II (1908), pp. 877 — 894. 

t There are thirty-two skulls sent to the Biometric Laboratory for which the numbers in Guha and 
B&su’s Report can be found. One of these (London No. 51) is an occipital fragment on which no sufficiently 
accurate measurements can be taken although S 9 and S 9 are given in the Report : another (No. 8) is a 
juvenile specimen for which no measurements are given there. Another juvenile (No. 14 =» Calcutta N. 176) 
is included in the tables of female adult measurements in the Report. 



6 


A Study of the Naga Skull 


TABLE I. 

Differences (in mms.) between Measurements of the same Naga Skulls taken by 
Kitson (K.) and Guha and Barn (G.B.). 


Differences in mms. 
(K. — G.B.) 

B' 

H' 

B" 

— 

LB 

J 

s, 

V 

Q'H 

NB 

o,x 

O a L 

<v 

m 

G. 

NH' 

- 10-05 9-06 




___ 

__ 

1 








_ 

- 9*05 8*05 

- 8*05 7*05 

- 7-05 6*05 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

- ti‘05 5-05 

— 

1 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

- 6-05 4-05 

— 

2 

— 

1 

— 



— 

— 

— 

— 

— 

— 

— 

— 

- 4*05 8*05 

— 

1 

— 

— 

— 

— 

— 

1 

— - 

— 

— 

— ... 

— 

— 

- $-05 2-05 

3 

— 

— 

— 

2 

— 

— 

1 

— 

— 

2 

— 

— 

7 

- S’OS VOS 

0 

1 

1 

— 

— 

2 

1 

— 

2 

— 

3 

— 

— 

4 

- 1-06 0-OS 

16 

1 

3 

— 

1 

4 

7 

4 

12 

3 

14 

— 

— 

7 

0 

2 

— 

2 

3 

6 

2 

1 

2 

3 

1 

— 

— 


1 

0*05 — 1*05 





3 

2 

10 

6 

4 

8 

9 

3 

3 

2 

1 

2 

1*05— 2*05 

— 

— 

2 

1 

1 

2 

1 

2 

— 

— 

1 

2 

7 

1 

2*05 — 8*05 

— 

— 

2 

— 

— 

1 

— 

2 

— 

— 

— 

5 

4 

— 

8*05 — 4*05 

— 

— 

3 

— 

— 

— 

— 

— 

— 

— 

— 

4 

6 

— 

4*05 — 5*05 

— 

— 

1 

— 

— 

— 

— 

— 

— 

— 

— 

1 

2 

— 

5*05 — 6*05 

— 

— 

— 

— 

— 

— 

— 

__ 

— 

— 

— 

3 

1 

— 

6*05 — 7*05 

— 

— 

1 

— 

— 

1 





— 





2 

— 

— 

7*05 — 8*05 
8-05— 9*05 
9*05—10*05 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 





1 























10*05—11*05 

— 

— 

— 

— 

— 

— 

— 

— * 

— 


— 

1 

— 

— 

Totals 

' 

27 

6 

19 

7 

20 

19 

14 

20 

26 

: 

7 

23 

20 

2! 

22 


ments were also used in computing standard deviations and coefficients of variation*. 
The comparisons between Guha and Basu’s readings for B\ LB and Q'H are not 
altogether satisfactory, but it may be assumed that the inclusion of their values of 
these measurements for the additional skulls will not affect the constants appreciably. 

The sexing of the Naga skulls collected in 1927 is particularly difficult owing 
to the fact that most of them are very defective. The sexual characters appear to 
be as well marked as for most races, however. Among the sample sent to the 
Biometric Laboratory we distinguished nineteen male adults : sixteen of these were 
supposed male by Guha and Basu and they give no sex for the other three which 
are occipital fragments. Of our fourteen female adults five were supposed male 
and five female by Guha and Basu, while the remaining four cannot be identified 
with their numbers. They distinguished sixty-one male and twenty-four females 
in their total sample though there was probably no such clear preponderance of 

* These are also the only measurements taken by Goha and Basu on the three Naga skulls in the 
Indian Museum which I have used. 




Biometrika, Vol. XXV, Parts I and II 

Klisalietli Kitson, A Shn/y of the iVa^a Skull 


Plate II 



Typical Male Naga Skull. Norma facialis. (R.C.S. 6 6231.) 






Biometrika, Vol. XXV, Parts I and II 

Klisiibeth Kitson, A Study of the Naga Skull 


Plate III 



Typical Male Naga Skull. Norma lateralis, (R.C.S. 6 ’ 6231 .) 





Biometrika, Vol. XXV, Parts I and II 

hlisabelh Kitson, A Study of the Naga Shull 


Plate IV 



A. Typical Male Naga Skull. Norma verticalis. (R.C.S. 6*6231 .) 



B. Male Naga Skull (R.C.S. 66232) with 
wormian bones in place of nasal bones. 


C. Male Naga Skull (1927 series, B .L. No. 44) showing 
the unerupted third left molar horizontal and 
preventing the second molar from erupting. 


Typical and Anomalous Naga Skulls. 




Elisabeth Kitson and G. ML Morant 


7 


one sex over the other if our sexing is more correct. The remainder of the sample 
of the 1927 skulls sent to the Biometric Laboratory is made up by twenty-one 
specimens representing immature individuals and eleven fragments which probably 
represent adults and for which no sexes can be given. The immature character of 
a few of the skulls placed in the juvenile group is somewhat uncertain owing to 
their incomplete nature, but one (London No. 14 * Calcutta No. N. 176) with a 
complete palate which was undoubtedly juvenile is included in Guha and Basu’s 
tables of female adult measurements. Our individual* measurements of all these 
skulls which were sent to London, with the exception of the adult unsexed fragments, 
are given in Table VII at the end of this paper together with those of all the Naga 
skulls in the Royal College of Surgeons. One of the last is immature ; the sexes of five 
others were either known, or they were estimated by Barnard Davis, Flower and 
Thane and their consistent determinations were accepted; the remaining two were 
collected in 1928 and I have supposed that one of these (No. 6*6235) is male and 
the other (No. 6*6234) female. I accepted the sexes given for the skulls for which 
I have used previously published measurements and these include those in Guha 
and Basu’s Report which were not sent to London. 

Remarks on sutures, for adult skulls, and on teeth for all I was able to examine 
are given in Table VII. Unless otherwise stated, the coronal, sagittal and 
lambdoid sutures of the adult specimens are all open. There appears to be a # clear 
preponderance of young adults over fully mature individuals in the case of both 
sexes and only one (No. 6*6235) *, which is a male, can be called ageing. Turner 
found one aged Tonkal Naga skull among the eight sacrificial specimens he examined. 
No. 6*6235 came from a cemetery so there is a suggestion that young adults and 
children were preferred as victims by the Naga head-hunters. The metopic 
specimens are three (Nos. 31, 6*6232 and 6*6233) out of twenty possible male 
crania, three (Nos. 4, 40 and 47) out of sixteen possible female and two (Nos. 43 
and 49) out of twenty possible juvenile specimens. Guha and Basu say for their 
total sample that the metopic suture is present “in fifteen specimens in various* 
proportions.” The number which they could examine for this anomaly is not 
stated, but it was probably about one hundred. It seems reasonable to conclude 
that, judging from the Naga crania known, the metopic suture occurs in more than 
ten per cent, of cases without regard to sex or age, no specimens so young that the 
suture would not normally have closed being included. The frequency is unusually 
high for a non-European race. It is possible that it is inaccurate owing to the fact 
that the collectors of the crania favoured those which were metopic. One male 
cranium with interparietal bones was found among five male, four female and two 
juvenile specimens which it was possible to examine for this anomaly : this (No. 56) 
has the ossa triangularia only separate *f". Wormian bones were found in more 

* The numbers beginning 6* are those of the skulls in the Royal College of Burgeons. Others from 
1 to 66 given in the text are the Biometric Laboratory numbers of the skulls collected in 1927 which were 
sent there. 

t Guha and Basu give a photograph of an unsexed occipital fragment (No. N. 96) having the oomplete 
form of interparietal bones (Plate IV, Fig. 7) : this was not sent to the Biometric Laboratory. 



8 


A Study of the Naga Skull 

than fifty per cent, of each sex and of the juveniles, while epipteric bones occurred 
with the like percentage. The region of the pterion could be examined on both 
sides in the case of seventeen male skulls and there is no example of fronto- 
temporal articulation, though one specimen (No. 31) shows a close approach to this 
condition on both sides: the same region could be examined for seventeen females 
on the right and sixteen on the left, while there is one case (No. 40) of fronto- 
temporal articulation on the left and the right pterion of the same skull is normal: 
for the juveniles the crania which it was possible to examine number fourteen on the 
right and seventeen on the left, while there is one case (No. 17) of fronto-temporal 
articulation on the left and the right pterion of the same skull is defective. Turner 
found two cases of fronto-temporal articulation on the left side only among the 
eight Naga skulls he examined. There are examples of single or multiple tympanic 
perforation in all the groups. A female specimen (No. 6*6234) shows traces of the 
sutures between the ex- and supra-occipitals on both sides. Most of the teeth of the 
adults preserved are considerably worn but in a good state of preservation. There 
are eighteen male skulls on which the palates are almost complete and for fifteen of 
these no teeth had been lost before death. Molars were the only teeth lost in the 
case of the other three and no carious teeth other than molars were found for the 
same group. Owing to their failure to erupt, two of the eighteen specimens had no 
third molars on either side and one other had no third molar on the right. There 
are Several examples of ‘shovel-shaped’ incisors (see Figs. 3 and 4, Plate V, in 
Guha and Basu’s Report). One male (No. 44) has the unerupted third molar on 
the left side horizontal and its crown is in contact with the second molar which has 
thus been prevented from erupting (see Plate IV C below). A male skull (No. 6*6232) 
has wormian bones in place of the nasal bones and the ethmoid is exposed below 
them (see Plate IV B). The absence of the nasal bones appears to be a very rare 
anomaly. Three, examples of it were described by the present writer in a paper on 
a series of Teita skulls from Kenya Colony *. A few healed woundh were found on 
the Naga skulls. None of these is large except that on a strong male specimen 
(No. 37) which had a severe sword-cut on the right side of the frontal bone (see 
Plate VI, Fig. 1 in Guha and Basu’s Report and Plate I below). 

No geographical divisions of the Naga crania for which measurements are 
available can be made, but it is of interest to compare the means of three 
groups obtained in different ways. This comparison is made in Table II for the 
characters which can be measured on the largest numbers of specimens. The 
groups are: 

(а) the crania collected in 1927 measured by the present writer, 

(б) the* crania collected in 1927 measured by Guha and Basu but not by the 
present writer, 

(c) the crania measured by Turner (seven male and one female), the female 
measured by Thane, two male and one female crania in the Indian Museum, 

* “A Study of the Negro Skull with special Reference to the Crania from Kenya Colony.” Biometrika , 
Vol. xxm (1981), pp. 271—814. See pp. 284—286 and Plate IV A, B and C. 



Elisabeth Kitson and G. M. Morant 


9 


Calcutta, for which measurements are provided by Quha and Basu, and the crania 
in the Royal College of Surgeons measured by the present writer. 

These three groups are made up of small numbers and considerable differences 
between the means may be expected if only on account of random sampling. There 
is, nevertheless, a fairly good correspondence throughout while there is a remarkably 
close one between groups (a) and (c). The means for the ( b ) group solely differ 
appreciably from those for the other two by having smaller male values of B\ S± t G'H 
and J. Only a small part, if any, of these differences can be attributed to the fact 
that differences would be found between Guha and Basu’s measurements for these 
characters and ours taken on the same crania. The male means computed from 
their measurements for the crania which we measured are: B r = 95*8 (16), 
S \ «B 128*1 (12), G’H= 70*5 (13) and 133*7 (14). All these differ by less than 
one mm. from the corresponding means of ours given in Table II. It is probable 


TABLE II. 

Mean Measurements for Groups of Naga Shulls. 



Male 

Female 



{b) Guha and 

(c) Turner, Guha 


(b) Guha and 

(c) Turner, Thane, 


(a) Kitson 

Bafm 

and Basu 

{a) Kitson 

Basu 

Guha and Basu 


(1927 Bcries) 

(remainder of 

(Indian Museum), 

(1927 series) 

(remainder of 

(Indian Museum), 



1927 series) 

Kitson (R.C.S.) 


1927 series) 

Kitson (R.C.S.) 

B’ 

94-9 (16) 

91-5 (38) 

93-9 (13) 

84-1 (13) 

88-6 (18) 

85-0 (6) 

LB 

99-9 (7) 

99-9 (15) 

99-1 (10) 

90-3 (3) 

93-3 (3) 

93-3 (3) 

Si 

128-6 (12) 

124-7 (19) 

127-3 (13) 

122-1 (11) 

121-8 (4) 

121-1 (ft) 

GII 

70*4 (13) 

63-6 (36) 

69-1 (11) 

64-6 (11) 

62-7 (16) 

66-6 (4) 

J 

134-1 (14) 

128-9 (21) 

133-8 (10) 

127-9 (7) 

124*1 (10) 

123-3 (5) 

NB 

27*1 (16) 

26*8 (36) 

26-5 (12) 

26-1 (11)' 

26-4 (16) 

25-7 (5) 

0{L 

39-1 (5) 

38-4 (36) 

38-4 (4) 

£8-1 (8) 

37*4 (17) 

40-1 (2) 


that the appreciable differences found between the male means of the two samples 
of skulls collected in 1927 are due to sexing. It has been seen that Guha and Basu 
were inclined to assign a larger proportion to the male series than we were and 
hence their male means would tend to be smaller than ours. The sexes of three of 
the skulls in Group (c) were known and those of the others were assigned by Turner, 
Flower, Guha and Basu (three) and Kitson (two). The close agreement of the 
means for this group with ours for the 1927 series encourages the belief that our 
sexing of the 1927 series is substantially correct and, if this is so, Guha and Basu’s 
is probably in error. It can be shown that the sex ratios given by their means 
are particularly small. The samples are too small to make it possible to decide 
these questions at all definitely. The means for the three groups were pooled to 
give the best mean values which it is possible to obtain for Naga skulls at present 
and it is unlikely that these figures differ appreciably from the true means of the 




10 


A Study of the Naga Skull 


TABLE III. 

Constants of Variation for Naga and Egyptian Shulls *. 



Standard deviations 

Coefficients of variation 


Nagas 

Nagas 

Egyptians E 


Male 

Female 

Male 

Female 

Male 

Female 

B 

4*97 ±-29 

4-27 ±-33 

5-36 ±*31 

4*80± *38 

4*28 ±*07 

4*11 ±*08 

LB 

4*01 ±-34 

— 

4*02 ±*34 

— 

3*90 ± *06 


$i 

5*26 ±*38 

— 


— 

4*88 ±*08 

4*66 ±09 

Q'H 

4*78±*29 

4*18 ±*36 

7’23±-45 

6*64 ±’66 

6*90± *10 

5*64± *11 

NH' 

2*55 ± *23 

— 

4*90 ±*44 

— 

6*66 ±-09+ 

6*31 ± *10t 

NB 

1*64 ±*10 

1*69 ±*13 

6*12 ±*37 

6* 19 ±*62 

7*27 ±*12 

6*98 ±*14 

J 

5*83 + *41 

— 

4*43 ±*32 

— 


3*62 ± *08 

0\L 

1*79±*13 

1*96±*18 

4*64 + *33 

5*19±*48 

4-06 ± -07 J 

3-97±-08J 

0%L 

2*13±*19 

— 

6*27 ±*57 

— 

6*66 ±*09 

5*62 ±*11 


* The numbers of skulls on which the Naga constants are based can be seen from Table IV : the 
smallest is 27. 

t These are for the Frankfurt nasal height NH, L in place of NH'. 

t These are for orbital breadths found by Fawcett’s curvature method in place of daoryal breadths. 

available sample owing to errors of sexing. The pooled means for all the characters 
considered are in Table IV. There is a very satisfactory agreement between the 
corresponding male and female Naga indices and angles, allowing for the fact that 
the samples are very small. It is generally found, as in the present case, that the 
female occipital (Oc. /.) and orbital indices are greater than the male while the 
male simotic (100 SS/SC) and palatal height-breadth indices (100 EH/G%) are 
greater than the female. The measurements suggest that the series for the two 
sexes represent the same racial type and a direct comparison of the skulls we were 
able to examine had suggested the same conclusion. The following mean indices are 
found for the immature crania: 100 G'H/GB = 67*6 (16), 100 NB/NH , R « 54*2 (18), 
lOOOt/Oi, L — 85*5 (20) and 100 SS/SC = 24*2 (18). As far as can be seen the 
juvenile specimens represent the same racial type as the adult. 

The standard deviations and coefficients of variation are given in Table III for 
all the characters which can be measured on 27 or more crania. No significant 
differences in variability are found between the two sexes. Comparison is made in 
the table with the coefficients of variation given for the long Egyptian E series of 
26th — 30th Dynasty skulls §. It is known that this series, which came from a single 
cemetery, is less variable than nearly all that have been obtained from European 
sites. The constants for nine characters can be compared in the case of the males, 
the Naga value being greater for six and less for the other three. The difference 

| Karl Pearson and Adelaide G. Devin: “ On the Biometric Constants of the Human Skull.” 
Biometrika, Vol. xvi (1924), pp. 828 — 868. 














TABLE IV. Mean Measurements of the Naga and a Burmese Series , 



* Definitions of the measurements denoted by the index-letters will be found in Biometrika, Vol. xxi 
(1929), pp. 82 — 84. 

f The following additional means o&n be given for the Burmese from Insein Prison : C*= 1404*0 (20), 
Glabellar 17*500*6 (28), Broca’s £'*804*8 (29), Biasterionio B* 107*0 (29), Laoxymal 0^89*8 (29), 
100 Os/Laerymal 0,*85*9 (29). 

X This is Broca’s nasal height from the nasion to the “base” of the anterior nasal spine. 















12 


A Study of the Naga Skull 

between the coefficients of variation exceeds 2*5 times its probable error in the 
case of ff (A/p.e. A — 3*4) G'H (2*9), J (2*7) and NB (3*0), while the Naga value 
is greater than the Egyptian for the first three of these characters. Three of 
the four female constants which can be compared are greater for the Nagas than 
for the Egyptians, but the difference is only significant in the case of 0\L (2*5). 
The Naga series is rather more variable than the Egyptian, but, as far as can be 
seen, it is not more heterogeneous than most which have to be accepted as repre- 
senting a single racial type. 

Comparisons between the male Naga and other Asiatic cranial series were made 
by the method of the coefficient of racial likeness. These coefficients have recently 
been given between all pairs of 26 Asiatic series* and we selected nine from these 
either on account of the fact that they represent neighbouring peoples, or because 
a rough comparison of the means suggested that they might be allied to the Nagas. 
The series chosen represent Tagals, a supposed non-negrito people from the 
Philippine Islands, Dayaks from Borneo (both measured by von Bonin), Tibetans of 
the A type from the south-west of Tibet and Nepalese (Moran t), Chinese from the 
province of Fukien (Harrower), late prehistoric Chinese from Kansu and Honan 
(Black), Burmese of the A type from Moulmein (Tildesley), Ainos (Koganei) and 
Hindus from Bengal, Orissa and Southern India (Danielli, Turner and Mantegazza). 
Comparison was also made with a series of 29 crania of Burmese men who died in 
Insein Prison, Lower Burma. Measurements of these were given by Sir William 
Tumerf and the previously unpublished means are in Table IV. The coefficients 
of racial likeness between the Naga and these ten series are given in Table V 
and the relationships suggested by these criteria are shown in Fig. 2 which is 
based on Fig. 3 in the paper by Woo and Morant cited. They concluded that 
the most reasonable classification was given if only the lowest orders of reduced 
coefficients were considered and for this purpose they ignored all greater than 19. 
There are four with the Nagas below this value. The lowest is that of 4*87 with the 
Tagals and there are several Asiatic series for which the lowest reduced coefficient 
which has yet been found is greater than this. The connection with the Dayaks 
is nearly as close though it is less intimate than that between the Tagals and 
Dayaks. The Tibetans A and the Nepalese are further removed from the Nagas, 
but their resemblance to them is still far closer than that between the Naga and 
any of the Asiatic types with the exception of those just mentioned. The Nagas, 
Tagals, Dayaks and Tibetans A may be considered to form a closely interrelated 
group since their relationships to one another are more intimate than any between 
them and any other Asiatic types for which craniological data are available. As far 
as can be seen at present it is only by tracing relationships through this group that 

* T. L. Woo and G. M. Morant : “A Preliminary Classification of Asiatic Races based on Cranial 
Measurements. ” Biometrika } \o\. xxiv (1982), pp. 108—184. The means themselves or references to sources 
in which they may be found are provided in the above paper : all have been published in Biometrika. 

f “Contributions to the Craniology of the People of the Empire of India, Part I. The Hill Tribes of 
the North-East Frontier and the People of Burma.” Trantactions of the Royal Society of Edinburgh , 
Vol. xxux. Part HI. No. 28 (1899). The measurements of the crania of the Burmese from Insein Prison 
are in Tables IU and IV. 



Elisabeth Kitson and 6. M. Mobant 18 

TABLE V. 


Male Coefficients of Racial Likeness between the Naga and other Asiatic Series *. 


Series 

Crude Coefficients 

Reduced Coefficients 

All characters 

Indices and 
angles 

All characters 

Indices and 
angles 

Tagals 

1 - 19±-19 ( 24 ) 
[ 24 - 1 , 26 - 0 ] 

- 0-56 ±-34 ( 8 ) 
[ lft - 4 , 23 - 1 ] 

4-87 ±-77 

- 3-02 ± 1-84 

Dayaks 

1-69 ±-19 ( 24 ) 
[ 24 - 1 , 44 * 1 ] 

0-25 ±-34 ( 8 ) 
[ 15 - 4 , 42 - 7 ] 

5 - 43 ± -61 

1-09 ± 1-52 

Tibetans A 

3 - 24 ±- 20 ( 23 ) 
[ 24 - 7 , 36 - 0 ] 

1-61 ±-34 ( 8 ) 
[ 16 - 4 , 35 - 4 ] 

11 -05 ±-68 

7-50 ± 1-68 

Nepalese 

4-08 ± -20 ( 23 ) 
[ 24 - 7 , 46 - 7 ] 

0-1 4 ±-34 ( 8 ) 
[ 15 - 4 , 44 - 4 ] 

12-74 ±-62 

0-02 ± 1-49 

Chinese : Fukien 

6 - 20+ -20 ( 22 ) 
[ 26 - 8 , 36 - 0 ] 

3-24 + -36 ( 7 ) 
[ 15 - 9 , 36 - 0 ] 

20-20 ± -66 

14-71 ± 1-64 

Burmese (Turner) 

6 - 78±-23 ( 17 ) 
[ 28 - 2 , 28 - 8 ] 

10 - 49±-43 ( 5 ) 
[ 16 - 6 , 28 - 6 ] 

23*80 + *81 

49-97 ± 2-03 

Burmese A 

7 * 16+ -20 ( 23 ) 
[ 24 - 7 , 40 * 2 ] 

6-80 ±-34 ( 8 ) 
[ 16 - 4 , 38 - 7 ] 

23 -99 ±-66 

31 - 09 ± l - r >4 

Prehistoric Chinese 

7 - 28±-21 ( 21 ) 
[ 25 - 6 , 36 - 9 ] 

3-48 ±-39 ( 6 ) 
[ 15 - 5 , 34 - 7 ] 

24 - 10 ±- C 9 

16-27 ± 1*82 

Ainos 

12-32 ±-24 ( 10 ) 
[ 27 - 9 , 78 - 6 ] 

4 -03 ±-43 (ft) 
[ 15 - 0 , 73 - 8 ] 

29-93 ±- ft 8 

10 - 1 G ± 1-71 

Hindus 

13-77 ±-23 ( 17 ) 
[ 28 - 2 , 76 - 7 J 

1-51 ±-43 (ft) 
[ 16 - 6 , 63 - 2 ] 

33-39 ± *56 

5-74 ± 1-62 


* The number in round brackets following the coefficient is the number of characters on which it is 
based. The numbers in Bquare brackets below the coefficient are the mean numbers of skulls available for 
the characters used in computing it : the iirBt is for the Naga and the second for the other series in the 
comparison. 

the Indian races can be linked up with the Northern Oriental races, the Chinese 
and Japanese, on the one hand, or with the Southern Oriental races, the Southern 
Burmesb, Javanese and Aetas, on the other hand. The members of the afore- 
mentioned closely interrelated group of intermediate types also stand between the 
Northern and Southern Oriental groups. This state of affairs accords reasonably 
well with geographical considerations. The Nagas and Tibetans A are more or less 
centrally placed with regard to the peoples of India and the Orient. The Tagals and 
Dayaks inhabit islands to the south-east to-day, but there is some evidence, considered 
below, which suggests that they came from the same inland area. If this is so it is 






14 


A Study of the Nag a Skull 



INDIAN JAVANESE, SOUTHERN' 

RACES. BURMESE & AETAS. 

472*0~75*l) (82.-0 — 84 *o) 




Elisabeth Kitson and G. M. Morant 


15 


leas surprising to find that the Nagas, who are quite dissimilar to the Southern 
Burmese, can be linked up with them by the Tagals and Dayaks. The Nagas have 
a reduced coefficient of 23*80 with the Burmese series from Insein Prison and the 
almost identical value of 23*99 with the Burmese A from Uoulmein. These two 
series from neighbouring towns have a crude coefficient of 3*57 ± *28 for 17 characters 
and the reduced value is 10*64 ± *69. 

Dr J. H. Hutton deals with the question of the origin of the Naga tribes in 
general in his book on the Angami tribe*. It is said that the weight of tradition 
points to migration from districts immediately to the south of the region occupied 
by the people to-day. “ Where the Nagas came from before they reached the country 
near Manipur is a much more difficult problem.... All sorts of origins have been 
ascribed to the race. They have been connected with the head-hunters of Malay 
and the races of the Southern Seas on the one hand, and traced back to China on the 
other.... On the basis of language their origin is assigned by Sir Q. Grierson to the 
second wave of emigration, that of the Tibeto-Burmans, from the traditional cradle of 
the Indo-Chinese race in North-Western China. . . .” In a footnote Dr Hutton accepts 
the view that “the Nagas have very strong cultural affinities with the natives of 
the Asiatic Islands, notably Borneo and the Philippine Islands, and perhaps physical 
affinities with some of them. 1 * The fact that there are strong affinities between some 
aspects of the cultures of these peoples is one which has been stated by Mr Henry 
Balfour f. Dr Charles Hose refers to the same cultural connection in a recent bookj . 
He writes: “Dr Hutton, Colonel L. W. Shakespear, Mr T. P. Mills and others have 
written very valuable and interesting books on the Naga tribes... and there seems to 
me to be a very close similarity in the legends, superstitions, customs, habits and arts 
of these tribes and [those of] the adjacent highlands of the remainder of the Brahma- 
putra basin, which is characteristic of one or other of the ruder lank-haired tribes of 
Borneo, Sumatra, the Philippines, and the other islands of the Malay Archipelago." 
The coefficients of racial likeness between the slender cranial series available suggest 
forcibly that there is a close physical, as well as cultural, relationship between the 
Nagas, the Dayaks and the “non-negrito” inhabitants of the Philippine Islands. If 
the arrangement shown in Fig. 2 illustrates the true connections of the Oriental 
and Indian races then it is probable that the Dayaks and Tagals came from an 
inland area which may well have been close to the region occupied by the Nagas 
to-day. It may be noted that the living people belonging to these races and the 
living Tibetans are generally said to possess Mongolian traits though they are not 
pure Mongolians. They are frequently said to have resulted from a blend of Caucasian 
and Mongoloid elements. 

The numbers of individuals making up the Naga cranial series are so small that 
it would not be profitable to make any detailed comparison between the means for 
single characters and those given for other Asiatic series. The mean cephalic index 

# The Angami Nagat (1921), p. 8. 

t The Journal of the Royal Anthropological Imtitute , Vol. xliv (1914), p. 57* 

X Natural Man , A Record from Borneo (1926), p. 12. 



16 A Study of the Naga Skull 

for 14 male skulls is 76*9 and for 8 females it is 76*7. Measurements given by 
Dr Hutton* for 237 Naga men belonging to various tribes lead to a mean cephalic 
index of 78*4. Remembering that the living index is expected to be about two points 
greater than the cranial, we have good reason to believe that our values are very 
close to the true ones for Naga skulls in general. The coefficient of racial likeness 
between the Nagas and Tagals could be based on 24 characters and only one of 
these shows a significant difference, i.e. an a greater than 10, this is for 
G 9 H(a** 16*03). The same thing is found in the comparison with the Dayaks 
(a for G’H=s 19*36) and the Naga upper facial height is shorter than the other two. 
The coefficient with the Tibetans A could be based on 23 characters and only one 
of these is found to indicate a significant difference, viz. H 9 (a = 14*10), and the upper 
facial heights are not differentiated in this case. Three or more characters are found 
to differ significantly in the case of every one of the other comparisons. The Nagas 
differ most markedly from the Nepalese and Hindus on account of their greater 
bizygomatic, calvarial and nasal breadths, from the Chinese on account of their 
lesser upper facial height and from the Southern Burmese on account of their lower 
cephalic index, lesser upper facial height and calvarial and nasal breadths, but 
greater calvarial length. The Aino cephalic index of 76*5 is very close to the Naga 
value of 76*9 and the coefficient of racial likeness was calculated in this case although 
no close connection was expected. Of the 16 characters compared six are found to 
differ significantly : these are LB (a = 48*25), J (43*51), B 9 (26*82), G 9 H (26*21), 
N /L (21*76) and NB (16*17). The Naga type has a smaller facial skeleton than the 
Aino. 

Of the adult Naga crania available in the Biometric Laboratory there were only 
six male and four female complete enough to be used in the ordinary way for the 
purpose of providing contours. No transverse or horizontal contours were drawn. 
The sagittal drawings were made in the usual way for all the adult^ specimens on 
which both the nasion and bregma could be located. Where possible the cranium 
was orientated by making the nasion, bregma and lambda the same height above 
the drawing board, but if the lambda was missing the anterior nasal spine, alveolar 
point or basion was used in its place. A new method of constructing the type had 
to be devised in order that use might be made of the outlines of the incomplete 
specimens for which the Ny base line is not available. Figs. 3 and 4 are the male 
and female types and the mean measurements from which they were constructed 
are given in Table VI. The base line is, as usual, Ny given by six male and four 
female crania only. For these the means of the ordinates VI — IX and the xb and 
y’s of the lambda, inion, opisthion, auricular point and sub-orbital point were found 
in the usual way. They also provide the mean values of the angle &Ny. The posi- 
tion of the bregma is then given by the means of the N/3 chords available for 
15 male and 12 female crania. This line — extended where necessary — is used as 
an accessory base line and it is divided into the tenths and other divisions indicated. 
The tip of the anterior nasal spine (NS), the alveolar point, the anterior ( p ) and 


Op. cit Appendix XI. 





aga Sagittal Contours . Mean Values * 

























so 


A Study of the Naga SkuU 

posterior (p*) extremities of the palate bones, the sphenobasion and basion all have 
their ^-co-ordinates measured from N along N/3 and their y-oo-ordinates perpen- 
dicular to this line. In order to obtain additional points in the pre-maxillary region 
it is also necessary to draw a line through fche alveolar point parallel to N/3, 
ordinates being taken from it at distances of ^ th and £th of N/3 from the alveolar 
point. 

The type sagittal contours constructed in this way are based on very small 
numbers of crania and no detailed comparisons with other types would be justified. 
Their most peculiar characteristics are the smooth and almost vertical sections of the 
frontal bones above the naaion,— the male figure having no distinguishable glabella 
prominence, — and the lack of projection of the sections of the nasal bones. These 
are characters which we should expect to find in the case of an Oriental type. 
A comparison with the type contours provided for other Oriental series shows that 
all their frontal and nasal sections are remarkably alike. The Tibetans A and 
Nepalese differ from the Nagas, Chinese, Burmese, Javanese, Dayaks and Tagals in 
having more projecting nasal bones. 

DESCRIPTION OF PLATES. 

I. Normal males (top row), normal females (middle row) and juveniles (bottom row) from the Naga 
erania collected in 1927. The Biometric Laboratory numbers of these specimens, reading from left to 
right, are: top row 87, 88, 44 and 18, middle row 4, 28, 12 and 40, bottom row 6, 16, 0 and 48. No. 87 
has a large healed wound on the right side of the frontal bone. 

II. Typical male Naga Bkull (B.C.S. 6'6231). Norma facialit (ca, 0*8 natural size). 

in. Typical male Naga skull (B.O.S. 6*6281). Norma lateralU (ca. 0*68 natural size). 

IV. A. Typical male Naga skull (B.C.S. 6*6281). Norma verticalit ( ca . 0*7 natural size). 

The nasal bridge of a male Naga skull (B.C.S. 6*6282) showing wormian bones in place of 
the nasal bones (ca. 2*0 natural size). The nasal bridges of two African negro (Teita) crania having the 
same anomaly are shown in A Study of the Negro Skull {Biometrika, Vol. XXHl'(1981)), Plate IV 
B and C. 

C. The palate of a male Naga skull (1927 series, B.L. No. 44) showing the unerupted third left 
molar horizontal and preventing the second molar from erupting (ca. natural size). 





THE ALBANIANS OF THE NORTH AND SOUTH 

(1) INTRODUCTORY ACCOUNT OF MEASUREMENTS 
AND PHOTOGRAPHS TAKEN IN 1929. 

By MIRIAM L. TILDESLEY. 

The Albanian people, whose tenancy of the Balkans is said to extend farther 
back into the mists of antiquity than that of any other Balkan race, and whose 
language — save for borrowings — has no affinities with other European tongues, is 
divided into two groups, Gege in the North, Toske in the South. The question at 
once suggested to the physical anthropologist is whether the difference of name 
covers also a difference of type. The measurements and profile photographs which 
form the basis of the statistical study which follows this paper were collected in an 
attempt to furnish some sort of answer to this question, the subjects being therefore 
chosen in approximately equal numbers from North and South. 

Light upon the physical characters and relationships of the Albanian people was 
however not the only, nor even the original object in making these investigations. 
They were prompted also by a quite different purpose. And since the circumstances 
which gave rise to this other purpose affect the records themselves, it will be necessary 
to explain them briefly here. 

In 1928, impressed by the enormous waste of effort and opportunity involved con- 
tinually in the making of anthropological records which are to a large extent non-com- 
parable among themselves, the writer published a paper entitled “Racial Anthropo- 
metry — A plan to obtain International Standardization of Method In the course 
of it reasons were given for supposing that the technique elaborated by Professor 
Rudolf Martin probably formed the most hopeful starting-point from which to build 
up an internationally acceptable technique. The reasons given — the chief of which 
was the wide following already obtained by the Martin school — were general: they 
included none based upon first-hand experience of the technique by the writer 
herself. As such experience would clearly be useful both in subsequent discussions 
of the problem and in estimating the reliability of records made by others in 
the field, she took an early opportunity of obtaining some instruction in the 
technique and then of practising it under field conditions. Both the amount of 
time spent in study and that available for field work were unavoidably restricted 
owing to various circumstances. 

* Joum. Roy. Anthrvp . ln*t. % Vol. lvot (1938), p. Ml. [The Editor feels bound to stole that in 
publishing this eoooont he has not overlooked the humonr of this explanation of the writer's purposes.] 



22 


The Albankt,7i8 of the North and South 

The limited amount of training must be clearly realised: it would be most 
unfair to her instructors to pose as a fully trained anthropometrist of the Martin 
school. The anthropological training given at the Anthropologisches Institut in 
Munich extends over several years and includes a far longer and more thorough 
practice of anthropometric technique than it was possible to crowd into the three 
brief weeks of her stay there. She has to record her deep gratitude to Professor 
Theodor Mollison and Dr Wilhelm Qieseler for the pains they took to make this 
short stay yield the maximum of profit. Intensive anthropometric instruction was 
given by Dr Qieseler to a class of two — the other student being Mr R. H. Post, 
whose visit to Europe has enabled him to record in detail some of the differences 
between the techniques employed by various teachers of anthropometry in Europe*. 
Practical instruction in photography was given by Professor Mollison; and advice 
as to instruments and equipment by both. The writer cannot forbear to express 
once more her grateful acknowledgment of the time and effort so generously 
expended. At best, however, it could be but three weeks of preparation, and the 
question arises as to how far one can rely upon the observations subsequently taken. 
It was part of the plan to get at any rate a measure of the observers own variability, 
by making the observations on a series of individuals twice over on different occasions ; 
also, if possible, to test her personal equation against that of some other user of 
Martin’s technique by independent measurement of the same series. Unfortunately 
neither of these projects could be carried out on the return to England owing to 
the demands of work accumulated in her absence; and it would have been useless 
to do so after some months’ interval in which memory had lost its sharp outlines. 
There is no certainty that the personal equation established during a few weeks’ 
application of a technique will be found unchanged months afterwards. 

In this particular, therefore, execution fell short of the plan. And unfortunately 
it must be not merely admitted but emphasized that this failure robs her records 
of a good deal of their possible value. The fact that most other published measure- 
ments on the living body suffer from the same defect does not help matters. No 
measurement on the living can be taken at its face value aB representing abso- 
lutely the size of a given character in a given person, for this will have a certain 
range of inherent variation as well as variation over which the observer has some 
(but not absolute) control, namely the posture of the subject and the observer’s own 
identification of terminals, his accuracy of reading and his ability to perform the same 
movements and pressures repeatedly in exactly the same way. From the standard 
deviation of the difference between pairs of observations by himself on a series one 
could work out the value of the standard observational error for any of his constants; 
similarly one could evaluate and allow for the effect of difference of personal equation 
between pairs of workers. Given enough of these records, one could fall back upon 
the most probable value forecast by them in cases where no direct measure of the 
difference was available for the pair whose measurements it was desired to pool or 
compare. It is strongly suspected that the combination of standard error, standard 

* Anthropologische Metaungen am lebenden Menaohen, Handbueh d. biolag . Arbeitsmethoden , Bd. 
«n (1981), S. 461. 



Miriam L. Tildeslby 


23 


observational error, and personal-equation difference would frequently be found to 
mask actual inter-racial differences for many characters that anthropologists now 
measure; and when to these factors we add a difference in the actual definitions given 
in the different techniques employed, the unreliability of face-value differences in 
the results obtained is of course greatly increased. If the systematic recording and 
publishing of standard observational errors and personal-equation differences 
should result, as it probably would, in the abandonment of a good number of observa- 
tions now taken on the living, the elimination of these futile and time-wasting 
measurements need not be regretted, even if they exposed the worthlessness of 
results obtained by much hard work in the past. The past cannot be recalled, but 
the future is ours. 

The actual measurements recorded by the writer, therefore, have only an un- 
certain value. Since it is now impossible to give any exact measure of their 
reliability the best she can do is to record certain impressions and experiences. Also, 
she is not the only worker who has used a technique without adequate training, 
and done so under the difficulties experienced in the field. Some in fact do so having 
had none at all; and others who have had as much as she may not have put it into 
practice fresh from instruction, as she did; also their field conditions may easily 
have been more difficult. It is probable therefore that her comments will apply in 
part at any rate to other published figures than merely her own. Before detailing 
these, however, she must explain what the field conditions were, and what subjects 
were measured. 

The subjects on which observations were made in 1929 were soldiers in Albania's 
conscript army. Their nominal ages were 21 and 22 years, but these figures are 
among those that cannot be taken at their face value, for two reasons. One is that 
even many of the more educated Albanian townsfolk have not up till now been ac- 
customed to keep any record of their ages; still less the uneducated villagers whom 
she measured. Their exact ages, therefore, were not known, and were only imputed 
to them in the army records. Secondly, conscription having been only very recently 
instituted, many young men were drafted into the army who would have been taken 
some years sooner if a regular army had come earlier into existence : it was quite 
obvious that in some cases the men were nearer thirty than twenty, and a few of 
them probably on the far side of thirty. They were drawn, as has been said, in 
roughly equal numbers from North and South. The actual districts were largely 
determined by the number of men from each present at that time in the regiment 
at Shkoder (Scutari). It was hoped at first to measure the Geges of Dibra. This 
choice had been suggested by a great authority on Albania, Miss Edith Durham; 
also, whenever the subject of stature or physique came up in Albania itself, the 
remark was generally made “You should see the men of Dibra. They are tall, fine 
Albanians.” As however there were not enough men of Dibra in the regiment, the 
Albanian prefecture of Kukes* was chosen, lying to the north and north-west of 

* This prefecture is part of the extensive district extending much to the north of the present 
boundary of Albania which under Turkish dominion was oalled Kossovti. It has nothing to do with the 
town of Kossova in the southern area. The name is still in use though not marked on modern maps. 





Miriam L. Tildesley 


25 

Dibra, and occupying the north-eaat comer of the present Albanian kingdom* The 
district is shaded in on the accompanying map, which similarly shows, in the South, 
the area from which the Southern Group was drawn, lying almost entirely within 
the prefecture of Gjinokaster (Argyrocastro). The larger towns were avoided as 
being perhaps more likely to be affected by racial mixture. Also Albanians of the 
North being partly Moslem, partly Catholic, with Catholicism as the religion of the 
southern Serbs; and those in the South being either Moslem or Orthodox, with 
Orthodox neighbours over the Greek border, some preference was given for Moslems 
in both districts, again with the object of avoiding as far as possible racial mixture. 
Among the Albanians themselves, the difference of religion corresponds to no 
difference of original stock, but to the subsequent accidents of conversion some 
centuries back. The last large-scale conversion was that to Mohammedanism, im- 
posed by force, under the Turk. Different communities may share the same village, 
but more often inhabit different ones in the same district. Inter-community marriages 
and individual conversions are both rare; the latter are said to be discouraged 
almost as much by the community joined as by the community abandoned. 

The individual sheets of observations record not only the man’s own village 
(which was of course that of his father and probably of all his male progenitors for 
generations back), but the distance between it and his mother’s village. It was 
interesting to discover that the men of Kukes tend to go rather farther afield for 
their wives than the men of Gjinokaster. Whether this can be associated, either as 
contributory cause or part effect, with the reputed better physique of the men in 
the North is a matter for speculation. The following are the distributions obtained, 
distance being measured by number of hours’ walk : 

Same } 1 1* 2 2J 3 34 4 6 6 7 8 9 10 11 12 over 

Group village hr. hr. hrs. hrs. hrs. hrB. hrs. hrs. hrs. hrs. hrs. hrs. hrs. hrs. hrs. hrs. 12 hoars Total 

Northern 74339891628262204 3 78 

Southern 45 8767141223 0 001 87 

It may be suggested that the villages are perhaps farther apart in the Kukes pre- 
fecture. Such may be the case : the most detailed map available* certainly gives that 
impression, but this may perhaps be accounted for by the northern villages being on 
the whole smaller and fewer of them recorded. Information wa9 obtained concerning 
the size of the villages from which 36 of our Geges (Northerners) came, and it gave 
a mean population of 393; for 47 of the Toskes (Southerners) the mean was 605 f. 
Presumably the means for the whole of our two groups would still show the average 
Gege village to be smaller than the average Toske, and thus contribute one reason 
why the Geg@ should need to go farther afield to seek his bride, being bound to 
avoid marriage with any woman whose male line of descent was known to include 

* In M. Justin Godart’s VAlbanie en 1921. 

t Date obtained from the 1927 census, published in Tirana, 1926, entitled Shqipria mH 1927. It 
was not possible to identify in all cases the villages entered on the measurement slips with those 
published in the census, owing to variations in the spelling. The writer wishes to express her thanks 
to His Exoellenoy Djemil Bey Dino, Albanian Minister in London, for his kind help in the identification 
of the districts in question and for the loan of the book quoted. 



96 


The Albanians of the North and South 

an ancestor in his own male line. This does not seem, however, to be the whole 
explanation, for if we confine our analysis to those men who came from the larger 
villages, with over 350 inhabitants, we find that out of 19 Qeges the mothers of 
only two had come from the father's village; out of 37 Toskes from the larger 
villages, 27 had both parents from the same village. Only three Qeges came from 
towns numbering more than a thousand inhabitants, and two mothers out of the 
three came from elsewhere. Ten Toskes came from towns of this size, and the 
mother of only one was brought in from outside. 

Without the privilege of access to the army and the assistance and facilities so 
courteously accorded it would have been impossible to obtain in the few weeks 
available the measurements and photographs which were eventually brought home, 
and the writer wishes to express her most grateful thanks for the great kindness 
she experienced in Albania. It may be imagined how different would have been 
the rate of progress if it had been necessary to waylay individual villagers as they 
came to the Scutari market, and to make to them the outrageous proposition that 
they should submit to the indignity of being handled, clad only in shorts, by one 
of the subject-sex. The reason alleged being incomprehensible and absurd, the real 
reason would probably be sinister, and in any case would be suspected. The results, 
if any, obtained in these conditions would have been relatively few and expensive. 

For the much more favourable conditions in which the work was eventually 
carried out the writer must first of all express her indebtedness to her friend 
Mr Qazim Kastrati, whose help, suggestions, and initiative throughout she most 
gratefully records, and in whose family she was privileged to enjoy the wonderful 
experience of Albanian hospitality. Military photographs being forbidden, no less 
authority than that of the Prime Minister was needed to override this ruling. 
Through the kind instrumentality of the British Consul and the British head of 
gendarmerie she was able to state her request in person to the*. Premier, His 
Excellency Koc;o Kota, and to return from the capital to Scutari armed with the 
necessary instructions. Finally, she wishes to express her thanks to the Commandant 
of the regiment, and to Adjutant Ibershimi who, being detailed to provide the men, 
accommodation and facilities required, carried out these tasks with great courtesy 
and good will. 

The field conditions were thus much easier than in some cases, but yet certain 
difficulties existed which doubtless had their effect upon the records. These were 
chiefly difficulties of language and of time. The regiment had to depart for Tirana 
three weeks after the work was begun, in order to take part in the celebration 
of the first anniversary of the accession of H.M. King Zog. The effective part of 
these three weeks was shortened by delays which sometimes occurred, soldiers 
being required for other duties than the duty of being measured. As my friends 
Mr Qazim Kastrati and Mr Teufik Kalatsi were only able to assist part of the 
time by recording the measurements — for which I thank them— the work had to 
be done in part with the assistance of two sergeants who had a smattering of 
English or French (too imperfect to remoye all danger of mistakes) and partly 



Mibiam L. Tildbsley 


27 


with the help of a corporal speaking only Albanian. In the latter case she either 
both measured and recorded, or measured and dictated the figures in Albanian. As 
she was not well, and the heat very great, and as the hours worked were long, fatigue 
was not absent ; and when fatigue supervenes, the translation of figures into unfamiliar 
words is neither so fluent nor perhaps so accurate as it would otherwise be. It is 
probable also that observations made in these conditions may themselves be less 
accurate, in spite of every endeavour to keep them up to standard. The extent 
to which such factors are likely to modify the results remains unknown ; but as, 
presumably, errors due to these causes are as likely to be in excess as in defect 
of the real values, the means at any rate may be very little affected. 

To come now to particular measurements : the values for chest girth when the 
lungs are fully inflated or fully deflated can certainly not be relied upon. Complete 
deflation was sometimes obtained by making the soldier laugh; the tape was held 
with one hand, while his ribs were tickled with the other and the Albanian 
word for “ Laugh ” uttered. Whether his ribs or his sense of humour were more 
tickled by this procedure is not certain, but in most cases it was immediately 
effective and both hands were then quickly used to tighten the tape round the 
collapsed chest. Sometimes, however, his gravity was portentous, and the lungs 
far from collapsed. Deliberate inflation and deflation being quite new to him, 
neither for the most part was performed very successfully; and though normal 
chest girth was usually the best measurement of the three, there was a tendency 
to inflate somewhat as soon as the tape was passed round the chest. Where 
operator and subject have only a few words of any language in common and time 
is limited, it must always be very difficult to make these measurements com- 
parable with those taken, say, by Germans on Germans who have been trained 
to breathing exercises from their youth. The “ balling ” of the upper arm muscle 
seemed on the other hand to be understood and performed much better, so that 
maximum upper arm girth is probably fairly reliable. So also is span. The closing 
of the teeth for measurement of total Face Height almost always offered some 
preliminary difficulty. The investigator is not underhung, but the soldier always 
responded to the demonstration of her own closed teeth by shooting out his lower 
jaw in front of the upper ; and so convinced was he that he had to do something 
unusual with his lower jaw that it often took several moments of voluble ex- 
planation and demonstration by the corporal in charge and by those soldiers who 
had already been through the process, assisted by physical force, before his 
bewildered mandible could be got into the correct position. Having got it there, 
however, he usually kept it well clenched, and the operator's impression is that 
the Face Height measurement was fairly well done. The identification of the nasion 
in the living is notoriously open to wide differences of practice, but she felt few 
doubts about her practice of the instructions received on this point. It was 
otherwise with the identification of the terminal for measuring Knee Height, and 
this character was therefore abandoned half-way through. Also with that for 
measuring Ankle Height, though this was not abandoned. Those projective measure- 
ments which depend upon posture — height from the ground of shoulder, elbow, 



28 


The Albanians of (he North and South 

* 

wrist, top of breast bone, etc. — are well known as being very subject to observa- 
tional error, error which will, however, be less with the very experienced observer. 
As she was not so very experienced, these must be to some extent suspect in her 
case, though she was not conscious of any particular misgivings while taking them. 
Stature seems likely td be the most reliable of these, as it is easier for the subject 
to assume the correct position than to maintain it for some time. Martin's method 
of taking Head Height was regarded by her instructors as unsatisfactory and liable 
to very considerable error : the results for this character must therefore doubtless 
be accepted with caution. Head Length and Breadth, on the other hand, would be 
among the most reliable measurements taken. How far the misgivings or con- 
fidence here expressed are justified, and how far any errors affected the results 
cannot now be tested except in so far as the comparative measurements available 
may tend to confirm them. Where they do not, the cause may not lie in the 
writer's departure from the technique she was attempting to apply, but in the non- 
conformity of the other observers' practice with that of her instructors. As far, 
however, as comparison of Gege with Toske is concerned, the same conditions 
apply equally to both. They were not measured at different times, but Gege and 
Toske interspersed, whether for measurement or photography. 

The profile photographs from which Albanian type silhouettes have been 
obtained were taken, not in the barracks, but in the rooms of a local photographer, 
who was good enough to allow the use of his premises, to assist by changing the 
plates in the holders, and afterwards to do a good deal of the developing. His 
possession of a limited amount of German, acquired during the Austrian occupation 
in 1914 — 1918, gave us a medium of intercourse, up to a point, but though he 
could read figures he was unfortunately unable to write them. This presented 
a considerable difficulty in that the plates removed from the holders had each 
to be marked with the soldier's number, for identification. At my suggestion he 
represented each numeral by an appropriate number of scratches on the plate, 
though the noughts baffied him and were attempted in various ways — sometimes 
by ten strokes. This device, however, was successful as regards nine-tenths of the 
photographs ; those in which the counting of the strokes as they were made was 
wrong, offered a task of some patience in identifying the men, but with the help 
of the measurements recorded all were identified in the end save two or three. 
The other difficulties attendant on the photography were those of time shortage 
(they were taken towards the end of the three weeks allotted) and the job of 
coping, often single-handed, with batches of a score of men who felt livelier out 
of barracks, of posing the one and keeping him motionless till photographed and 
at the same time preventing some of the others from interfering with the photo- 
grapher's property. Sometimes there was the complication of a client for the 
photographer, before which all else had to give way, and one's own camera and 
properties to be shifted. A standard distance between camera and subject was 
aimed at throughout and a standard focus ; these were kept as exact as possible 
in the rather unplacid circumstances. It is hoped that the photographs, thanks 
to the labour spent upon reducing them to type silhouettes, may form a definite 



Discussion of M. L. Tildbslsy’s Data 29 

addition to our knowledge of the Albanian head. One omission, however, has 
robbed them of some of their value, and for this the writer must now publicly 
don the white sheet which has long been her garb in the Biometric Laboratory 
whenever the subject of the Albanian photographs has been raised there by the 
Editor of this Journal . She omitted to take any direct measurement on the head 
which would give exactly the scale of the photograph. For this reason, though 
the Albanian type silhouettes give the proportions of the type head, they fail to 
give their exact sizes, thus preventing exact comparison with other type silhouettes. 
The omission was due to no other circumstance than her own failure to realise the 
necessity of such a measurement. 

It remains to record the result of certain observations on the back of the head 
and on the teeth. The majority of the heads examined were distinctly flat-backed, 
but a note was made of those that seemed “ round/’ with a roundness that would 
be seen more in N . verticalis than in N. lateralis. Thirty-three out of 77 men from 
the North are thus recorded, as against six out of 89 from the South. Difference 
significant. Occipital asymmetry was also noted, and found to occur in eight 
out of the 77 Kukes men (three on L., five on R.), and in 15 out of 89 from 
Gjinokaster (three on L., 12 on R.) ; the difference is non -significant. As regards the 
teeth, edge to edge bite was observed in 16 out of 78* from Kukes, and in 14 out 
of 89 from Gjinokaster. Obviously-irregular dentition was noted in only two 
out of each group — would the same were true of our own countrymen! Only 
two were underhung, and only one had open bite, these three being all from 
Kukes. 

The scientific results of the Albanian Expedition were thus twofold : on the 
one hand a gain in experience which the writer feels to be very much worth while 
to herself, and on the other hand these records, imperfect in some respects, con- 
cerning the physical characters of Gege and Toske. She cannot sufficiently express 
her gratitude to Professor Karl Pearson for the very great labour bestowed by 
the Biometric Laboratory on these records and for his kindness in permitting them 
to take a place, albeit a modest one, in Biometrika. She is greatly indebted to 
Dr Morant for the loan of his camera and telephoto lens, and to him and to 
Miss N. Karn for the statistical treatment of her measurements and for the 
reduction of the silhouettes to types ; also to the draughtsmen, Miss E. Irvine and 
Miss M. Kirby, of the Biometric Laboratory for the large amount of drawing work 
involved. 

(2) DISCUSSION OF MISS M. L TILDESLEY'S MEASUREMENTS 
ON THE NORTHERN AND SOUTHERN ALBANIANS. 

By the Staff of the Biometric Laboratory. 

An examination of Table I shows that the group of men studied by Miss Tildesley 
from the North is differentiated in essential characters from the group drawn from 

* Not quite all the men were available for the entire series of observations and again for photography, 
henoe the variation in numbers. 



80 The Albanians of (he North and South 

the South. Taking the differences of absolute body sizes — North minus South — we 
find the deviations in terms of the probable errors of those deviations are for: 

Stature: 7*7; Span: 7*8; Sitting Height: 3*4 ; Suprasternal Height: 6*9; 
Acromial Height: 7*4; Elbow Height: 7*3; Wrist Height: 6*7; Finger Height: 
5*6; Foot Length*: 12*6; Foot Breadth: 4*0; Chest Breadth: 3*1; Chest Depth: 
4*1; Chest Girth at rest: 5*0; Chest Girth inflated*: 7*0; Hip Breadth: 4*5; 
Waist Girth: 3*4; Upper Arm Girth (straight): 3*9, (flexed): 5*3; Head Circum- 
ference: 6*1. 

In no case was the Southern Group greater in a bodily measurement than 
the Northern Group. Clearly the man of the Northern Group is in nearly all 
respects significantly a larger and more muscular being than the man of the 
Southern Group. 

Turning now to the facial measurements the Minimum Frontal Breadth, Face 
Heights, Nasal Measurements, Orbital Measurements and Ear Diameters show no 
significant differences. There is no distinction in Neck Girth or Bi-zygomatic 
Breadth. Turning to the head measurements we find that the excess of Head Length 
in the Northern over the Southern Group is 14*7 times the probable error of the 
difference, and the defect in breadth of the former is 8*0 the probable error of the 
difference. There is no significant difference in the Auricular Heights. 

These racial distinctions are well illustrated in the three cephalic indices. The 
Breadth-Length Index of the Southern Group being probably the highest known. 
Such an index may, of course, occur in individuals, but as a racial mean it is of the 
rarest occurrence. Again the Height-Length Index of the Northern Group is 
remarkably low. The Fronto-Mandibular Indices are almost certainly significantly 
different, although it is nob possible in the absence of the standard deviations to 
assign the degree of significance of any of the indices. No other indices have 
obviously differences of importance. 

The variability as measured by the standard deviations shows nothing like the 
same differentiation, the highest ratios of difference to probable error being in Hand 
Breadth (35) and Foot Breadth (4*2). This is in accordance with the general 
experience that racial differences are usually those of type rather than those of 
variability. 

We may take it as proven that the Northern Albanians do differ substantially 
from the Southern in bodily size and, it would appear, also in head-shape. But the 
high value of the first cephalic index in the Southern Group together with certain 
vague rumours have led to statements that the Albanians distort the heads of their 
children. Thus Eugfene Pittard states that the variability of the Albanian head 
measurements “peut aussi provenir des pratiques de deformations qui sont loin 

* These measurements are of difficult accuracy, but there could be no bias between the two groups, 
as they were measured on the same spot in the same manner as the individuals occurred on the muster 
roll. v : 



TABLE I OmkatoUjvr the ineaeut^ 


(Measurements in centimetres.) 


Character 

Absolute Measurements 

Albanians of the South 

Albanians of the North 

No. 

Mean+Prob, 

Error 

Stand. Deviation 
* Prob. Error 

No. 

Mean* Prob. 
Error 

Stand* Deviation 
+Prob. Error 

m 

Stature 

86 

163*72 ±*449 

6*14 ±*318 

77 

160-03 ±-620 

6*68 ± *374 

(23) 

Sitting Height 

86 

87*94 ±*275- 

3*73 ±*194 

77 

89-19 ±*246 

3*20±*174 

(17) 

Span 

85 

169*32 ±*495 

6*76 ±*350 

77 

175*58 ±*630 

8*20± *446 

(4) 

Suprasternal Height 

84 

133*58 ±*388 

5*28 ±*275“ 

77 

137*91 ±*493 

6*42 ±*349 

(8) 

Acromial Height 

86 

133*29 ±*400 

5*47 ±*283 

77 

137*86 ±*472 

6*14± *334 

(9) 

Elbow Height 

85 

102*80 ±*318 

4'36+ ±-226+ 

76 

106-33 ±-363 

4-69 ±-267 

(10) 

Wrist Height 

86 

78*43 ±*256+ 

3-49 t-iso* 

77 

81*03 ±*290 

3-77 ±-205+ 

(11) 

Finger Height 

85 

60*93 ±*235- 

3*21 ±*166 

77 

62*84 ±*245“ 

3*18±*173 

(36) 

Shoulder Breadth 

85 

37*06 ±*121 

1*66+ ±*086 

75 

37*48 ±*138 

1*77 ±*098 

(40) 

Hip Breadth 

85 

27-66+ + ‘110 

l*50+± *078 

77 

28*33 ±*102 

1*33 ±*072 

(36) 

Chest Breadth 

85 

25*63 +*096 

1*31 ±*068 

77 

26-06+±-097 

l-26±-069 

(37) 

Chest Depth 

85 

19*18 ±*081 

1*11 ±*058 

77 

19*70 ±*095" 

1*23 ±*007 

(62) 

Hand Breadth 

85 

8*39 +*035 

•47 ±*024 

77 

8*49 ±-028 

*36 ± *020 

(58) 

Foot Length 

85 

24*30 ±*089 

1*21 ±*063 

77 

25*97 ±*098 

1*27 ±*069 

(69) 

Foot Breadth 

85 

10*06- ±*048 

•66 ±*034 

77 

10*29 ±*037 

*48 ±*026 

(61) 

Chest Girth : at rest 

85 

87*81 ±’264 fi 

3*61 ±*187 

77 

89-79 ±-296+ 

3*84 +*209 

(61*) 

„ „ inflated 

85 

90*96 +*272 

3 72 ±*193 

77 • 

93*85++ -314 

4*08 ±*222 

(61") 

„ „ deflated 

85 

85*67 ±*284 

3*89 +*201 

77 

86*62 +*262 

3"40± "186 

(62) 

Waist Girth 

85 

72*24 ±*304 

4*15+±*215- 

77 

73*63 ± *278 6 

3-62 ±-197 

(66) 

Upper Arm Girth (straight) 

85 

24*53 ±*107 

1*46 ±*076 

77 

25-14 ±-117‘ 

1*53 ±*083 

(65 1 ) 

it i) ii (bent) 

85 

27*79 +*127 

1*73 ±*090 

77 

28*78 ±*138 

1*80± *098 

(66) 

Lower Arm Girth 

85 

25*40 +*099 

1*35- ±*070 

77 

25*74 ±*095+ 

1*24 ±*067 

(69) 

Calf Girth 

85 

33*46 +*152 

2*08 ±*107 

76 

34*03 ±*136 

1*76 ±*096 

(63) 

Neck Girth 

84 

35*47 ± *102 6 

1*39 ±*073 

77 

35*76 ±*112 

1*45 ±*079 

(1) 

Max. Head Length 

85 

17*70 +*042 

■58- + *030 

77 

18*05 ±*049 

*64 ±*035 

(3) 

Max. Head Breadth 

84 

16*07 ±*029 

•57 +*030 

77 

15*65 ±*044 

*57 ±*031 

(15) 

Auricular Height 

85’ 

12*17 +*039 

*54 ±*028 

77 

12*10 ±*044 

*57 ±*031 

(4) 

Min. Frontal Breadth 

84 

10*93 ± 029 

*40 +*021 

77 

10*87 ±*030 

| *39 ±021 

(6) 

Bi-zygomatic Breadth 

83 

14*10 +*036 

•48 ± *026+ 

76 

14*07 ±*039 

*50 ±*028 

(») 

Mandibular Breadth 

84 

10*69 ±*037 

*50- + *026 

76 

10*87 ±*041 

*53 ± *029 

(45) 

Head Circumference 

85 

55*15**+ *106 

1*45+ ±*075 

77 

56*07 ;> ±*107 

1*39 ±*076 

(17) 

Hair Line to Chin 

85 

17*82 ±*064 

•88 ±-046 

76 

17*98 ±*065+ 

*85 ±*046 

(18) 

Nasion to Chin 

84 

11*90 ±042 

*57 ±*029 

77 

12*05 ±*046 

*01 ±033 

(21) 

Nasal Height 

85 

5*64 +*024 

*33 +*017 

77 

5*64 ±*033 

*42 ±*023 

(13) 

Nasal Breadth 

85 

3*44 ±*016 

*22 ±*012 

77 

3*41 ±*018 

*23 ±*012 

(22) 

Nasal Depth 

84 

1*71 +*017 

*23 +*012 

77 

1*71 ±*017 

*22 ±*012 

(16) 

External Ocular Distance 

85 

8*78 ±*020 

*39 ±*020 

76 

8*75+ ± *027 

*34 ±*019 

(9) 

Internal Ocular Distance 

84 

3*24 +*017 

*24 +*012 

77 

3*26 +*018 

*23 ±*013 

(12) 

Pupillary Distance 

85 

6*42 ± *023 

*32 ±*016 

77 

6*41 ±*022 

*29 ±*016 

(29) 

Ear Length 

85 

6*16 ±*028 

*38 ±*020 

77 

6*18 +*028 

*37 ±*020 

(30) 

Ear Breadth 

85 

3*69 ±*019 

*26 ±*013 

76 

3*71 ±*019 

*24 ±*013 


Indices, found from the Ratio of Means only. 


Sitting Height Index (23)/(l) 

85 

53-7* 


77 

62-8* 


Span Index (17)/(1) 

85 

103*4t 


77 

103*9+ 


1st Cephalic Index (B/L) 

84 

90*8 


77 

83-9 


2nd Cephalic Index (H/L) 

85 

68*8 


77 

04*9 


3rd Cephalic Index (H/B) 

84 

75*7 


77 

77*3 


1st Nasal Index (B/H). 

85 

61 ’0 


77 

60*5 


2nd Nasal Index (D/B) 

84 

49-6 


77 

60*1 


Fronto-Mandibular Index (4)/(8) , 

84 

102*2 


76 

100*0- 


Face Index (18 )/(0) 

83 

,84*4 


76 

86*6 


Aural Inder (30)/(29) 

85 

59*8 * 


76 

60*1 



* English men at standard age 40 5 years : 52*7. 
t English men at standard age 40*5 years : 102*8. 

The numbers in brackets placed before the characters indicate the numbers in the corresponding 
sections of Martin’s Lehrbuch der Anthropotogie, the measurements described therein being those taken 
by Miss Tildesley. The English names are not intended as descriptions of the measurement. They are 
those provided by Miss Tildesley in the list she gave the writer of means and standard deviations; they 
have been occasionally contracted to allow of the table being printed on one page, and oooasicnallv 
slightly expanded for the sake of luotdity. 











32 


The Albanians of the North and South 

d'etre abandonntos par lea populations de la Pdninsule balkanique V* Pittard 
gives no references to deformation among the Albanians, nor is it clear what he 
means by the variability of the head-measurements, for as judged by the standard 
deviations this is not outstanding in Tildesley's data. He quotes no standard devia- 
tions and is possibly only referring to the difference in type between North and 
South. 

Remarks on the above Table. Unfortunately when the above material was handed 
to the present writer for discussion, it was found that in the case of the 
41 absolute measurements, seven of the chief characters were only given as far 
as their means and standard deviations were concerned to two decimal places, but 
their probable errors to three. The remaining 34 were given to four decimals. To 
make the table uniform, the means and standard deviations are all given only to 
two decimal places, but the probable errors to three that there may be at least two 
significant figures in all cases. 

No means nor standard deviations of the indices were provided, so that all that 
it has been feasible to do was to take the ratio of the means of the absolute measure- 
ments. The absence of the chief index distributions is a loss not only because they 
would have provided further evidence of differentiation between the two groups, 
but because they would have allowed a wider comparison between Miss Tildesley’s 
measurements and those of other investigators to be made. The large amount of 
time devoted to the enlargement of the photographic profiles, the measurement of 
these enlargements, and the final reduction to type contours did not permit of 
further work by members of the Biometric Laboratory Staff on this material. 

It is extremely difficult to get definite information with regard to the treatment 
of infants in Albania. The present reporter applied first to Mrs M. M. Hasluck, well 
known for her travels in Albania. She most kindly made inquiries and Wrote to 
him two letters which cannot be said to clear up the matter entirely but are in- 
structive. From the first dated “ Elbasan 23. xii. 30 ” f we make the following extracts 
touching this point: 

Albanian babies lie beside their mothers till the latter get up. Then they are kept 
permanently (night and day) in cradles ; the mother even lifting the cradle on to her knee to 
feed the child. In the cradles they lie lightly swaddled. 

As to head distortion, for years I’ve been chasing the story of babies being strapped on 
boards to make their heads flat. It has met me from Zagori, a Greek district, a little north of 
Jannina, the capital of Epirus, to Scutari in Albania. Of all the cradles I’ve investigated, and 
they are pretty numerous by now, not one was without its little mattress and its pillow. The 
average head from Zagori up through Albania has obviously an unusually flat back, and so for as 
my present knowledge goes there is no more in the story than that. The story of the boards in 

* Eugtoe Pittard: he* People* de* Balkan * . Recherche t anthropologique* done la Pdnintule dee 
Balkan*, tpicialement dan* la Dobroudja, p. 980, Geneva, 1990. 

t No two authors, no two maps agree in the spelling of Albanian plaoe names. There appears to be 
as yet no standardisation. Accordingly, we have thought it beet to use throughout the spelling adopted 
by eaoh author cited in discussing his work, notwithstanding the apparent resulting medley. The task 
of sta nd a r d i s in g must be left for philologists to dispute over ; we are content to show how necetsazy it is. 



Discussion of M. L. Tildbslby’s Data 


Albania may also be helped on by the curiously plank-like shape Albanian men (and many 
peasant women) give their faces by shaving the hair on their temples. 

Miss Durham, I believe, thought the story all foolishness. So far I have found no cases in 
Albania of the tight bandaging of the head practised in W. Macedonia and elsewhere as recorded 
in my notebooks. 

Some months later I received a second letter dated “Eibasan 11. iv. 31.” In it 
Mrs Hasluck says : 

I hoi>e you will be interested to hear that I’ve run to earth the story that Albanian beads are 
flat at the back because their owners were strapped on purpose to boards when babies. Within 
the last weeks I have seen two new-born infants, both strapped to boards. But between each 
infant and its board there was always a mattress an inch thick, with a thicker pillow (of rags) 
under the head. Is that not enough to prevent the board from influencing the shape of the 
baby’s head 1 Both mattress and pillow are as thick as any used among the various populations 
of Macedonia, who do not strap their infants on boards. 

The women state that they strap the babies on the board to make their hold on life as strong 
as the board. This ignores the fact that the floor of the cradle is wooden and at least almost as 
thick as the board. Babies are strapped on the board till baptised, if the parents are of the 
Orthodox religion, and for forty days if the parents are Mohammedans, descended from Orthodox 
ancestors. For the momeut I have no information about the Catholic Albanians, but hope to get 
some within a day or two. Nor have I any about Mohammedans descended from Catholic 
Albanians. 

The Orthodox qualify their practice, however. Baptism follows birth within a week as a rule, 
in case the baby dies, when its soul would be lost. They state that they put the baby in a cradle 
as soon as baptised. In practice, however, they wait till the moon is full, to ensure the baby’s 
living out the allotted span. I incliue to think that the board is used as a worthless thing, on 
which the worthless, hardly human, unbaptised baby is most fittingly strapped. Tn the town of 
Elbasan, which is either pure Orthodox or Mohammedan ex^Orthodox, and has no wood, a tile 
is used instead of a board. 

Or, it may bo that the board has survived from an earlier time when cradles were not known. 
Albanian infants are not swaddled round and round like Macedonian and something had obviously 
to be done to keep their backs straight. 

Some Albanians have rationalised the custom to me, saying that it has arisen because the 
mother’s father must present the cradle, and that cannot be bought till the child has been born,— 
counting your chickens before they are hatched being a dangerous procedure in Albanian opinion, 
not merely a foolish one as with us. This explanation ignores the fact that the grandfather 
presents a cradle only when the first child is born, and that the (generally numerous) children 
who follow are laid in the same cradle. Often, too, a cradle is preserved for several generations 
in the same family, and the grandfather is not required to present a new one. 

Head-moulding is as deliberately and as commonly practised as it was in Macedonia, and by 
the same means, a handkerchief tightly bound round the temples. ‘ Who wants a head like an 
apple ? 5 they say. But they never associate the board with their attempts at head moulding.” 

It is probable that we may safely dismiss the strapping to a board before 
baptism as a cause of the brachycephaly of the Albanian Southern head. An 
English medical woman who had studied gynaecology in Vienna told the present 
writer that in the lying-in hospital in Vienna the babies were invariably strapped 
to boards, apparently for the convenience of handling, and that the nurse would 
carry about four to six of these strapped babies at a time. Vet one does not find 
that, the Austrian Germans are as a whole more brachycephalic than Southern 
Germans. 

Blometrika xxv 


8 



34 The Albanians of the North <md South 

The last paragraph of Mrs Hasluck’s second letter appears to indicate that the 
Alhft.n ia.nH are willing to head-mould, but that their attempts must be singularly 
ineffectual. A tight bandage round the temples should achieve two results, (a) a high 
skull and ( b ) a round horizontal section. But the Albanian skull has a very 
low auricular height*, and its characteristic feature is that veiy apple-shape which 
the head-moulders are seeking to avoid — the Albanian head is the roundest head 
in Europe! The reported distortion of the Albanian head has probably little 
influence on the extreme brachycephaly of the Southern Albanians. It is not 
unlikely to be a post hoc explanation of a remarkable natural character. 

Accepting the standpoint that the Northern and Southern Groups studied by 
Miss Tildesley form in bodily and cephalic characters two distinct racial types, we 
may now consider how far her measurements are in accord with those of previous 
investigators. 

We may deal with the workf of Raffaello Zampa first. On S.209 he gives measure- 
ments of 59 Albanian men living in Italy — Calabria, Cosenza. This Albanian colony 
is found in the centre of the toe of Italy, but we have no clue to the part of Albania 
from whence the men camej. Zampa states that their mean stature was 164 cms., 
which agreeswith that of Miss Tildesleyfor her Southern Group. He gives afew unde- 
fined measurements of which the most remarkable is the Head Breadth = 148 mm. 
No one, as far as we are aware, has suggested a lesser breadth than 155 mm. for 
any Albanian district and the average for 119 cases from various localities of 
Haberlandt and Lebzelter is 158 mm. The mean Head Length of Zampa’s 
Albanians is 183 mm., which agrees exactly with Haberlandt and Lebzelter’s pooled 
value for 119 men. Zampa gives for the first Cephalic Index the value 80'6 (807 
from his frequency distribution), a value far below anything suggested by later 
inquirers for any Albanian group. Thus it does not seem possible to lay any stress 
on Zampa’s figures. 

In 1897 Leopold Gliick§ published a more important paper giving individual 
measurements of 30 male Albanians from the north of the country. Fifteen of these 
men came from Prizren, 11 from Djakova, 1 from Novi-Bazar and 1 from Ipek, 
these being their birthplaces (see map, p. 24). All these towns lie outside the 
present boundaries of Albania, although the first two of them are within 10 — 15 
kilometres of the boundary of Miss Tildesley’s Northern Group, and they were 
within the old boundary of Turkish Albania. Comparison, if any, must therefore be 

* “La hauteur du crAne (diara&tre aurioulo-bregrnatique) eat Agalement faible chez lea Albanais. 
Toutes lea populations de la Pdninsule des Balkans... except^ lea Serbes, ont une hautenr dn er&ne 
supdrieure & celle des Albanais... c’est le plus petit or&ue examine jusqu’A present parmi les groupes 
brachyedphales de la PAninsule des Balkans. II eat d’autant plus n4oessaire de souligner ee fait que la 
taille des Albanaia est, en moyenne, une des plus AlevAes de oette region,” Eugene Pittard: Les Peuples 
dn Balkans , p. 283, Geneva, 1920. 

t 4< Vergleichende anthropolegisohe Ethnographic von Ap alien,” Zeitschrift filr Ethnologic , Bd. 
xvni (1886), S. 167—193 u. 8. 201—282. 

X See for the aoooont of this settlement Norman Douglas : Old Calabria, London, 1920, pp. 151—192. 
The migration started in the 15th century with the advance of the Turks. 

| “Znr phytiacben Anthropologic der Albaneaen,” Wissenschaftlichs Mittheilungen aus Botnim and 
der Hercegovina, Bd. ▼ (1897), S. 866—402. 



Discussion of M. L. Tildesley’s Data 


85 


made with Miss Tildesley’s Northern Group. Unfortunately Gliick gives no 
definitions of the measurements taken. 

It will be seen that Miss Tildesley’s Northern Group has in ail directions a bigger 
bodied and larger type of man. The one exception is the Head Height, but we do 
not know what measurement Glttck was taking, and whether it was from the centre 
of the ear passage, or again whether it was the vertical height or not. If it 
were from the centre, 8 mm. was a quite reasonable difference. To, Glttck’s 
“Entfernung des Ohrloches von der Nasenwurzel” we shall return later; it is 
probably the measurement Miss Tildesley unfortunately omitted to take. 

Tildesley’s Northern 
(Hhok (80 men) Group (77 men) 

(from his Table, (from our Table, 

Comparable Character s S. 874—75) p. 81) Remarks 

Stature 1684 1690*3 Of the twelve comparable absolute 

Span 1704 1755*8 characters every one is greater in 

Chest Girth 874 897*9 Tildesley’s Northern Group, when cora- 

Head Circumference 553*5 560*75 pared with Gliick’s still more northerly 

Head Length 183*5 186*5 series. The odds against this alone are 

Head Breadth 153 1 56*5 4095 to l, if we neglect the only moderate 

Bi -zygomatic Breadth 139 140*7 correlations of the characters. The 

Cephalic Index 82*58 83*9 probability of the observed differences 

Internal Ocular Distance 31 0 32*6 in Head Length and Head Breadth 

Foot Length 259 259*7 occurring in 12 samples from the same 

Ear Length 58*6 61*8 population is of the order *0004. 

Nasal Height 55 56*4 

Nasal Breadth 32 34*1 


Nasal Index 58*74 

Characters measured by one author only. 

60*5 


Forehead Height 

60 


Tildesley’s measurements would give 69*3 
were it legitimate to subtract distance 
from gnathion to nasion from distance 
from gnathion to crinion. 

Forehead Breadth 

[1031 

108*7 

There must be an error in Gltick’s Table, 
as the value 103 does not agree with his 
value 109 on p. 371. 

Hand Length 

14 Entfernung des Ohrloches 

187 

— 

von der Nasenwurzel” 

103 

— 

Gluck’s measure is discussed on our pp. 
48-49. 

Characters non-comparable owing to definitioi 

« or to personal equation. 

Face Height 

184 

179*8 

Must be due to definition, or to personal 
equation in determining gnathion. 

Lower Faoe Height 

125 

120*5 

Difference due to personal equation in 
determining nasion or gnathion, pro- 
bably the latter. See remark to Fore- 

Face Index («(0)/(17)) 

75*77 

78*25 

neaa neignt. 

Difference follows from remarks on Faoe 

Head Height 

129 

121*0 ’ 

Height. 

Possibly Gluok measured from centre of 
auricular passage; Tildesley measured 
from the “tragion” 

Mandibular Breadth 

103 

108*7 

Must be due to personal equation in 
determining the jaw-angle. 

Gliick’s measurement, like Martin’^ is 
from the “Augenwinkel,” but was it 
really taken from orbital margin? 

External Ocular Distance 

92*6 

87*5 

Hand Breadth 

.89 

84*9 \ 

? Differences due to personal equation in 

Foot Breadth 

106 

102*9 4 

flattening hand and foot. 


$-8 



S3 


The Albanians of the North and South 

It will be seen that while Miss Tildesleys values are more in accordance with 
Gltick’s than with Zampa's, the divergence between them points to a racial change 
north of the present Albanian border, or else to extreme diversity in methods of 
measurement, which undoubtedly occurs for certain characters. 

We may consider here, somewhat out of order, a paper by Eug&ne Pittard of 
1922 *, since it deals only with the Cephalic Index* He gives this index for 116 men 
from different districts in Albania, from the environs of Scutari in the north to those 
of Argirocastro (Guinokastra) in the extreme south, “ainsi que des localities 
appartenant k la partie centrale de l’Albanie.” It is not surprising that Pittard 
found a value, 87*9, for the first Cephalic Index almost the weighted mean of Miss 
Tildesley’s North and South Groups, i.e. 87*5. It is not possible, however, to pool 
the north, central and south Albanians in this manner, and we do not understand 
the basis of Pittard’s remark; 

Une telle homogdn<$itd...o8t h souligner fortement, car on n’on trouvorait gubre de semblable 
dans la Peninsule des Balkans (p. 60). 

Another paper by Pittard f deals with 26 £ crania { from two churches in the 
south of Albania at Argirocastro (Guinokastra), and Moscopole (? = Muskopolie * 
Voskopoie, see map, p. 24), and these should accordingly belong to the Toskes, or 
Southern Albanians. For these 26 skulls the Length is 170*4 and the Breadth 148*2, 
giving a Cephalic Index of 86*7. If we assume some 7 mm. allowance for flesh and hair 
on the Maximum Cranial Length and 12 mm. on the Parietal Breadth, these would 
give a Cephalic Index on the living of about 90*3 in reasonable accord with Miss 
Tildesley s value of 90*8. Pittard’s seven El Bassan male crania give a Cephalic Index 
of 84*8, already as a cranial value in excess of Miss Tildesley’s 83*9 for the living 
head in the North Group. El Bassan is just north of the Skumbi, and it seems 
highly probable that Central Albania would show an index intermediate between 
Miss Tildesley’s two groups. 

A paper by Max Kassbacher§ deals with five Albanian skulls in the Anatomical 
Institute of Heidelberg University, one of the skulls being that of a juvenile. Very 
ample measurements are taken, namely 65 absolute measurements and 83 indices 
and angles! No means are given — in4eed they would be of little value on four 
individuals— and the author concludes, on the basis of a graphical method due to 
Toldt, that the skulls belong to the so-called 11 Dinarischen Rassen.” No locus is given 
for the source of the skulls, and the writer seems ignorant of Pittard’s paper of 1924 
dealing with much larger numbers. The question of sex does not appear to be 
considered. By a graphical method due to Mollison, it is asserted that the skulls 

* “L’lndioe odphalique ohez 116 Alb&nais,” Revue anthropologique , xxxnidme annde (1922), 
pp. 48 - - 5 1. 

t “Contribution & l’dtude anthropologique de i’Albanie. L’lndioe clphalique de 58 crAnes d’Al- 
hanais,” Inetitut international d'anthropotqgie, utemc $e$nion Prague , 1924, pp. 22Q— 226 (1926). 

t There is one skuU Irotn Scutari and seven males from £1 Bassan. The 28 female skulls do not 
oonoern us here. 

g “Metrisohe und vergleiohende Untersuc^ung an Albanerseh&delo,” ZeiUchtifi fUr Anatomic und 
Eniwicklung»ge$chichtt , &d. to (1929), S. 19$— 221. 



Dimiewion of M. L. Tildbsley’s Data 


37 


form “eine einheitliche Gruppe” (S.210). The Cephalic Indices of the five skulls 
are given as 78 9, 81 - 4, 79 - l, 79 7, 81'2; these values contrast strangely with Pittard’s 
86'8 for 34 and 87 ‘8 for 24 $ skulls for the southern portion of Albania. The 
Heidelberg crania may have been brought from the north, in which case they 
would confirm the view that whatever these skulls may be the Albanians themselves 
are not “eine einheitliche Gruppe.” The paper, notwithstanding its extensive array 
of figures, is of small service for our present purpose, not only on account of the 
smallness of the series, but also because no locus of origin is supplied. 

There is still another contribution by Eugbne Pittard from 1920*. In this he 
deals with a large variety of measurements taken on some 112 men from different 
parts of Albania. He says that of this number there were 27 Gh&gues and 51 Toskes, 
defining the Albanians from the north of the river Skumbi as Gh&gues and those 
from the south of it as Toskes. It is not clear who the remaining 34 Albanians were : 
they must have come from either north or south of the Skumbi, but they may be 
hybrids. Clearly Tildesley’s North and South Groups, which come from relatively 
small areas in the extreme north and south, do not correspond with Pittard’s Ghbgues 
and Toskes. Thus we have: 

Tildesley’s Southerners Pittard’s Toskes Tildesley’s Northerners Pittard’s Gh&gues 
Stature 1637 (85) 1673 (48) ' 1690 (77) 1683 (27) 

Cephalic Index 90*8 (84) 87*0 (51) 83*9 (77) 84*7 (27) 

The differences are in the same direction, but they cannot be supposed to arise 
from two pairs of samples of the same two populations. 

Meanwhile Pittard’B cpmbined Albanian results should not show marked 
differences from Miss Tildesley’s results in those characters for which her Northern 
and Southern Groups show no marked racial distinctions. 

Nothing is said in the volume by Pittard as to definitions of measurements, 
but there is little doubt that Pittard followed Broca. Now the fact that for a 
considerable number of characters there is almost complete accordance between 
Pittard’s combined Albanians and Tildesley’s— i.e. for Sitting Height, Span, 
Maximum Head Length, Auricular Height, 2nd and 3rd Cephalic Indices (less 
than a unit), and Bi-zygomatic Breadth — suggests that we are dealing with the 
same, mixed population, possibly not mixed in quite the same proportions, but the 
striking difference between the values of other characters compels us almost of 
necessity to believe that the two observers are not defining their quantities in the 
same manner or that the personal equation, to which measurements on the living 
are subject, is too great to admit of racial conclusions being safely drawn. 

We take first the nose measurements. Miss Tildesley gives absolutely the same 
Nasal Height for both her groups and very nearly the same Nasal Breadth. But it 
is clear that Pittard is measuring his Nasal Height, and probably his Nasal Breadth, 
in quite a different manner. The result is that judged by Nasal Index we should 
conclude on this character alone that we are dealing with two Very distinct races. 

# Let Peupkt d€i Balkan* , Geneva (1920) (the full title is gives in the footnote, p. 82);, eee Pitta rd’i 
pp. 378 — 291, 



38 


The Albanian* of the North and South 


TABLE II. 


Character 

No.: 

Pittard’s 

Combined 

Albanians 

Average 112 

Tildesley’s 

Southern 
Group 
Average 85 

Northern 
Group 
Average 77 

Combined 
* 162 

Stature 

1678 

1637 

1690 

1662 

Sitting Height 

886*9 

879-4 

891*9 

885-3 

Span 

1718 

1693 

1766 

1723 

Maximum Head Length (L) 

181-3 

177-0 

186-5 

181-5 

Maximum Head Breadth (B) 

166-0 

160-7 

166-6 

158*7 

Auricular Height (H)* 

121-4 

121-7 

121-0 

121-4 

Minimum Frontal Diameter 

111-1 

109-3 

108-7 

109-0 

1st Cephalic Index (B/L) 

86-4 

90-8 

83-9 

87*5 

2nd Cephalic Index (H/L) 

66-8 

68-8 

64-9 

66-9 

3rd Cephalic Index (H/B) 

77*1 

75-7 

77*3 

76*5 

Height of None 

51-35 

56-4 

66-4 

56-4 

Breadth of Nose 

35-3 

34-4 

34*1 

34-3 

Nasal Index 

68-8 

61-0 

60-5 

60-8 

Bi-zygomatic Breadth 

140-7 

141-0 

140-7 

140*9 

Ear Length 

63-75 

61-6 

61-8 

61*7 

Ear Breadth 

36-1 

36-9 

37-1 

37-0 

Aural Index 

56-6 

59*8 

60-1 

59-9 

External Ocular Distance 

96-6 

87-8 

87-5 

87-7 

Internal Ocular Distance 

30-7 

32-4 

32-6 

32*5 


* “Diam&tre auriculo-bregmatique.” Pittard does not state how the bregma is to be found on the 
living, but his value of H agrees closely with those of Miss Tildesley. 

If the two observers are measuring from different points f then the urgency of some 
standardisation in definition and measurement becomes obvious. Jtfuch the same 
remarks apply to the aural measurements, where we reach aural indices whose 
differences we may be fairly sure are personal not racial. Another like case is that 
of the ocular distances, where Miss Tildesley appears consistent with herself, but 
differs widely from Pittard It is quite possible that in Maximum Head Breadth 
and even in Stature (where a good deal of adjustment is needful) personal equation 
is playing its part. Minimum Frontal Breadth has so little variation that one is 
inclined to believe that even the two millimetres difference may be personal equation 
depending on the pressure of the calipers ! 

Another paper which deserves consideration in our present inquiry is that 
of Haberlandt and Lebzelter§. This paper has a considerable advantage over some 

f Martin's External Ocular Distance (16) is to the external oanthi and Broca's to the external 
margins of the orbits. This may account for the difference, but only the more emphasises the need for 
standardisation. 

t Miss Tildesley, following her instructors, may be assumed to be seeking the nasal suture, which 
Martin considered identifiable in the living. Pittard is probably following Broca who measures from 
the “raoine du nez” to the “point sous-nasal" (Instruction gtntrales , 1879, p. 182). On pp. 189—140 
Broca says that the “raoine du nez" or “nasion” corresponds on the skeleton to the nasal suture. 

| “Zur pbysischen Anthropologic der Albanesen,” Archiv fUr Anthropologic , Bd. xlv (N.F. xvii) 
(1918), 8. 128—154. 




Discussion of M. L. Tildesley’s Data 


39 


others in that it deals with Albanian soldiers and states very definitely from what 
parts of Albania they were drawn. The districts are all in the northern section, 
i.e. north of the Skumbi, and mostly in the extreme north, covering a good deal of 
Miss Tildesley’s Northern Group area. Two localities ought, however, to be excepted. 
If we may judge by Haberlandt and Lebzelter’s data for 13 men only, then in the 
district of Kastrati there exists a very tall local race (1738*5 mm.) with extreme 
brachycephaly (88 2). This district is to the west and to the north-west of Miss 
Tildesley’s area. In brachycephaly, but not in stature, it approaches Miss Tildesley’s 
Southern Group. The second group of Haberlandt and Lebzelter’s measurements 
which ought to be excepted when we compare their data with hers is that of 22 men 
from Kruja. This district is more than half-way down to the Skumbi, and in their 
Cephalic Index, 89*8, we are approaching that of Miss Tildesley’s extreme southern 
area, i.e. 90*8. The mean Stature of this group is the lowest of all in Haberlandt 
and Lebzelter’s districts, i.e. 1680*2, but it is very sensibly higher than Miss 
Tildesley’s 1637*2 for the extreme south. Still these authors agree with her general 
result that stature decreases and brachycephaly increases as we pass from North 
to South Albania. The anomalous position of Kastrati requires further investigation*. 
Pooling all our authors’ data, with the two exceptions named above, we have observa- 
tions which can at any rate be compared with Tildesley’s Northern Group. The 
data for Miss Tildesley’s Northern Group are given in the centre of the Table below 
alongside Haberlandt and Lebzelter’s results. On the extreme left is given Miss 
Tildesley’s Southern Group and on the extreme right the Kastrati and Kruja 
groups of the former observers are combined. These two outer columns are not to 
be compared with each other. Further there is no justification for our adding an 
extreme northern group like Kastrati to the midland Kruja group. Our object is 
as follows: We have seen that while the stature and head-shape of the Albanians 
change widely from north to south, yet scarcely any, possibly no, change takes 
place in the facial characters. This fact provides us with a means of determining 
whether different observers professing to use the same scheme of measurements — 
here Martin’s directions f — reach really comparable results, or whether we must 

* It is the high head breadth, 161*1, of the Kastrati males whioh deserves special consideration. It 
is even greater than Miss Tildesley’s 160*7 for her Albanians of the extreme south. 

t Haberlandt and Lebzelter do not direotly state that they followed Martin; they give no definitions 
of the characters they have measured. But they do say that they used R. Martin's “ Measblatter ” and 
in part R. Pdoh’s (for prisoners of war). Martin's “ Messblatt ” refers to the numbers in his Handbueh . 
A description of Pdch’s “Beobachtungsblatt” for prisoners of war is given by him in the Mittheilungen 
der anthropologischen GetelUchaft in Wien, Bd. xlvi (1916), S. 116 — 128. He says (8. 1 2B) : 

“Die 26 Masse dieses Messblattes eind alle in dem somatologisohen Messblatte von R. Martin.... 
Die Numerierung sowie die Benennung der einzelnen Masse ist jedooh genau dieselbe wie im Martin- 
sohen Beobaohtungsblatte, so dass sich jedermann fiber die bei den einzelnen Messungen gehandhabte 
Teohnik nach der Numzner, welche das Mass ffihrt, im Martinschen Lehrbuche unterriohten kann.” 

It is dear from this that Haberlandt and Lebzelter are using Martin’s technique, precisely as we may 
suppose Miss Tildesley’s instructors to be doing. Personally the present writer does not wonder that 
they oould in a number of measurements reaoh different results. The definitions and methods of 
measurement provided by Martin are in some cases so obscurely stated that one is not surprised at 
the amount of personal equation which flows from their use, and one wonders it Martin himself had 
ever applied them to long series. 


40 


The Albanians of the North and South 

infer that their personal interpretations lead to character-differences which are 
certainly of the order of racial differences. 

Comparing in Table III the two central columns for Northern Groups we see that 
as far as the following characters are concerned : Stature, Head Breadth, Bi-zygomatic 
Breadth, Minimum Frontal Breadth, Mandibular Breadth, Nasal Breadth, and possibly 
Internal Ocular Distance, no distinction between the two sets of measurements of the 
Northern Albanians can be made. When, however, we turn to Face Height with the 
corresponding Face Index, to Nasal Height and Depth with the corresponding two 
Nasal Indices, and possibly to Head Length and the corresponding Cephalic Index, 
we mark divergences which in the case of cranial measurements would be said to mark 
racial differences. Do they do so in this case, or are they due to different interpretations 
of definitions, to the use of different definitions, or to persoual equation in measure- 
ment? We think a partial answer can be given to this question. Take first the Face 
Height ; Tildesley gives reasonably like Face Heights for her Northern and Southern 


TABLE III. 

Comparison of Tildesley s with Haberlandt and Lebzelter's results fur 
Northern Albania. 

(Measurements in mm.) 


Character 

Tildesley’s 

Haberlandt and Lebzelter’s 





No.: 

Southern 

Group 

(83—85) 

Northern 

Group 

(76—77) 

Northern 

Groupt 

(83—84) 

Kastrati and 
Kruja combined 
(33—35) 

Stature 

Head Length 

Head Breadth 

Cephalic Index 

Face Height 

Bi-zygomatic Breadth 

Face Index 

Minimum Frontal Breadth 
Mandibular Breadth 
Fronto-Mandibular Index 
Nasal Height 

Nasal Breadth 

Nasal Depth 

Nasal B/H Index 

Nasal D/B Index 

Internal Ocular Distance 

1637 

177-0 

160-7* 

90-8 

119-0 

141-0 

{84-4} 

109*3 

106*9 

{102*2} 

56-4 

34*4 

17*1 

{61*0} 

{49-6} 

32-4 

1690 

186*5 

166-5 

83-9 

120-5 

140-7 

{86-6} 

108-7 

108-7 

{100-0} 

56-4 

34-1 

17*1 

{60*5} 

{50-1} 

32-6 

1687 * 

184-4 
166*7 

85-1 

116-7 

141-0 

83*1 
107*4 
[107* 5] J 
99-9 

51-9 

33-8 

[22*7U 

66-2 

C7-1 

31*9 

1702 

179-8 

160-4 

89-2 

116*4 

142*8 

81-8 

108-4 
[107 *9] J 
100-6 

61-9 

34*8 
[23 *2] J 
67-2 

66*6 

33-8 


Results in curled brackets are the indices obtained from the ratio of means of 
absolute characters. 


* Omitting one breadth of 105 (I), 
f Omitting the "Serben und Tiirken ans Podgorica" group. 

t Absolute measurements not provided, only the Fronto-Mandibular and Nasal Depth/Breadth 
Indices recorded. The absolute values are supplied from the indices. 




Discussion of M. L. Tildesley’s Data 


41 


Oroups, and Haberlandt and Lebzelter do the same for their two groups. Both 
observers are self-consistent but differ extremely the one from the other. There 
cannot be a doubt that we are not here dealing with a racial difference but with a 
matter of definition or interpretation of definition. Turn again to the Nasal Height, 
Tildesley’s results are the same for the two groups and so are Haberlandt and 
Lebzelter’s. Both agree in the statement that the Nasal Height does not vary from 
one group to a second. But these observers differ so substantially in the absolute 
height of the nose, and the resulting Nasal Index (breadth/height) that a racial 
differentiation would be legitimately asserted, had we not the absolute equality 
within their own measurements of groups which are racially distinct in other 
characters; the like remark applies equally to their measurements of Nasal Depth 
which are self-consistent within the same observers’ data and inconsistent between 
observers. 

We are inclined to think the same criticism may be applied if in a less degree to 
Head Length. Tildesley gives a Head Length which is greater than that of any one of 
the six districts or for the pooled data of Haberlandt and Lebzelter, thus she obtains 
a lower Cephalic Index than they do. Picking out the Austrian investigators’ 
groups which correspond geographically most nearly to Tildesley's northern area, the 
Cephalic Index obtained is 85*1, as compared with Tildesley’s 83 9, and the difference 
lies entirely in the measurement of the Head Lengths, for the Head Breadths in both 
cases are practically identical (156*6). Whether this difference in Head Length is due 
to the choice of the frontal terminus, to choice of the occipital terminus, to a difference 
of dealing with the hair on the back of the head, or to the use of different types of 
head spanners, or to conventions as to the maximum length in asymmetrical heads, 
we cannot say. All we can draw attention to is that in measurements on the living 
methods of procedure and personal interpretation may lead even in suoh important 
racial characters as the Cephalic Index or the Nasal Index to divergences as great 
as those which in cranial research mark definite racial differences. 

With these facts before us, sad as the conclusion is, we are compelled to hold 
that the great and ever accumulating mass of measurements on the living are 
practically worthless. Observers may state that they are following the directions 
of such and such an authority, but this is of no avail. Obscurity always arises about 
verbally defined “ points,” which are not points. The only solution is training under 
instructors whose methods have been standardised one with the other. And this 
standardisation involves not only that of human instruments, but of the machines 
employed. How many observers think it needful to test their scales and calipers 
against standard scales before they start, and how many remember to repeat the same 
on their return? The present conception seems to be that a very large number of 
measurements vaguely made on relatively few individuals by wholly unstandardised 
observers will be of value in building up the racial history of mankind. This is the 
veriest delusion. Sad as the statement may seem, the work of measuring the living 
will have to be restarted with highly trained observers, standardised internationally 
(or better supernationAlly) one with another. As the astronomers started inter- 



42 


The Albanian# of the North and South 

nationally to divide up the heavens for their great star-chart, so anthropologists 
when they have ceased to be dilletanti will divide up the world, working like the 
astronomers in a standardised way with standardised instruments, to reach an 
international racial chart. Important as the choice of measurements on the human 
body maybe, that choice and the mode of definition is of far less value than personal 
and instrumental standardisation. Sadly, but without hesitation, we affirm that what 
has been done will have sooner or later to be scrapped, and anthropometricians must 
start afresh, not even from internationally accepted definitions — which do not define — 
but on a basis of international standardisation. Even then we scarcely believe that 
measurements on the living body will give as satisfactory results as measurements 
on the skeleton. But at the same time variations in external bodily type, if we can 
measure them— especially in physiognomy — are of the greatest interest and value; 
they are not necessarily in every case highly correlated with the skeletal framework 
beneath them. Perhaps, as in the case of astronomy, it is not to direct measurement 
that we shall trust in the future, but to measurement on standardised photographs. 
Some attempt has been made in this direction by the type silhouettes of the 
Biometric Laboratory. These are only on trial at present, but if they are successful 
the same method might be applied to other parts or aspects of the human body. 
The rapidity with which a hundred standardised photographs could be obtained in 
the field as compared with 30 or 40 measurements on 100 men is a great advantage, 
and it leaves the laborious task of measurement to be undertaken in the laboratory 
with proper instruments under standardised conditions. As a slight contribution to 
thiB topic the type silhouettes of the 67 Northern and 67 Southern Albanians will 
be discussed in the remaining section of this paper*. 


(3) SILHOUETTES OF THE ALBANIANS OF" THE 
NORTH AND SOUTH. 

By the Staff of the Biometric Laboratory. 

The accompanying silhouettes are based on the photographs taken by Miss 
Tildesley during her stay in Albania, and have been dealt with in the customary 
manner. This consists briefly in the most delicate process of enlarging the profile 
photograph by aid of a Ooradi pantograph, this being done in two stages to reduce 
the error of enlargement. The next process is to modify this enlargement, also by 
aid of the pantograph, so that the distance from mesoporion to the nearest point of 
the nasal bridge coincides with the like distance measured with the ear-plug 
spanner (i.e. the spanner used to take the auricular height) on the subject of the 
photograph. We have now a life-size silhouette of this subject. By aid of somewhat 
elaborate co-ordinate systems, a very large number of measurements are taken on 

* A certain number of photographs could not be used because a portion of the head was not on the 
plate, or because the tragion or sub-orbital point was not visible on the plate, or beoause the subdian 
sagittal plane of the subject was not parallel to the focal plane of the eamera, or for other defects in 
photographing. 




Biometrika, Vol. XXV, Parts I and II 

Tildesley, ihc Albanians of the North and South 


Plate I 



Type 8ilhouette of the Northern Albanian Group. 


Biometrika, Vol. XXV, Parts I and II 

Tildesley, The Albanians of the North and South 


Plate II 



Type 8ilhouette of the Southern Albanian Group. 



Discussion of M. L. Tildbslby’s Data 


43 


this profile outline. This process is repeated oh every one of the individual photo- 
graphs, which are to be pooled to obtain the composite, and then the average of 
each of the co-ordinates thus obtained is taken, and these are plotted afresh to give 
the average or type contour. Diagrams I and II give the type contours of Miss 
Tildesley’s Northern and Southern Groups respectively, with the mean values of the 



co-ordinates marked upon them. They have been reduced accurately to half scale. 
To obtain the line for orientation a small white wafer is placed on the sub-orbital 
point, and this appears on the photograph. From this with the oentre of the 
auricular passage, it is possible by a slight adjustment to obtain very approximately 
a line representing the Frankfurt Horizontal. 



44 


The Albanians of the North and South 

Although the suggestion that she should take standard photographs of- the 
Albanians was made to Miss Tildesley from the Biometric Laboratory and a camera 
with a telephoto-lens was lent to her by one of the Staff, she failed to realise the 
importance of a portion of the instructions given to her. Unfortunately not one of 
the facial or cephalic measurements taken by Miss Tildesley is of any real service 
for the production of standard type silhouettes. If a photograph be taken in profile, 



no major length that could be of real service for determining absolute size must 
have a terminal beneath the hair. Thus maximum head length and auricular 
height are of little value. Facial measurements in the median plane are too ama.ll, 
too .vaguely determined, or' too arbitrary to be used. Thus in dealing with the nose 
height, if it be measured from the nasion(l) to the subnasal point, the length is not 
only too small, but both terminals are too vague, and the sdme remark applies to the 



Discussion of M. L. Tildbslby’s Data 


46 


nasal depth. The fecial heights from nasion to chin, or from hair line to chin would 
perhaps be satisfactory as to length, but the determination of the two terminals, 
however clean cut on the enlarged photograph, appears to have been hopeless on 
the, living. That is to say, the scales of enlargement of the photograph to bring it 
up to life-size as judged by the several facial measurements were not only quite 
different for the individual, but for the average of the series. No other method 
was adopted by the investigator for standardising her photographs*. She did, 
however, mark on the subject’s face the sub-orbital point and the tragion, and these 
are visible on the majority of the photographs. Unfortunately she did not measure, 
as she might have done, by aid of a projection spannerf, the distance between the 
sub-orbital point and the tragion projected on to the median sagittal plane J. 

The method adopted by the Biometric Laboratory is to insert an ear-plug about 
an inch in length and & mm. in diameter in the auricular passage, the plug is held 
in position against any resistance of the ear-lobe by a simple arrangement. The 
visible end is coated black, with a central white spot, which is reproduced by the 
photograph. On the photograph, after mechanical enlargement, the distance from 
this white spot to the nearest point of the nasal bridge, the hyperrhinion (no 
searching for nasion !) can be at once measured. But the actual distance from the 
central axis of the auricular passages to the nearest point of the nasal bridge can 
be at once measured by the head spanner which takes the auricular height, and is 
inserted in the auricular passages by aid of its ear-plugs. Thus the actual life-size 
of the individual of which we have the profile photograph is determined. The 
knowledge of the central auricular point and of the orbital point on the photograph 
enables us by a slight correction to obtain a good approximation to the Frankfurt 
Horizontal 

Of course there are other methods of obtaining the scale of enlargement needful 
for the photographs. Thus: 

(i) We may place a scale in the median sagittal plane of the subject's head. 
This is easy enough with the instrumental fittings in the laboratory, but far less 
easy in the field, and does not free us from the need of determining from the 
Tnesoporionll the standard horizontal plane. 

(ii) We may place our subjects in such a position that their median sagittal 
planes are always at the same distance from the focal plane of the camera. This is 
an acjjjustmeut relatively easy in the laboratory and the photographic room, but by 
no. means .easy in the field, where the ground may be rough and the subject’s seat 
extemporised. 

* k eritloal examination of the photographs shows that they were not taken in the same standard 
manner. The Qhair and the camera were not in the same positions for ail individuals. 

^ t See BtymetHka, YoL r. p. 

t The tragion is, however, in itself a most unsatisfactory point. See Biometrika , Vol. xx*. p. 893. 
g See Biometrika , Vol. xx B . p. 889. , ~ m • 

]j Ibid. p. 8S9. 



46 


The Albanians of the North and South 

It may simplify the work to have all the photographs to standard size, but it is 
risky to attempt this and then fail in that standardisation, for all is then lost. It is 
therefore better to have a knowledge of size from a standard length on subject 
and on photograph. We have not been able to discover anything as good as, still 
less better than, the distance from the hyperrhinion to the central auricular axis. 

To obtain any type contours from Miss Tildesley’s photographs we had to 
follow a roundabout and by no means thoroughly safe road ! After a long period 
of perplexity we determined to proceed as follows: 

There were 134 suitable right profiles available, 67 of each group. It was clear 
as already stated that the scale of reduction was not the same for all these photo- 
graphs. The sub-orbital point and a point anterior to the ear termed by Miss 
Tildesley the “tragion” had been marked on the subject by black dots, and these 
made it feasible to obtain an approximation to the Frankfurt Horizontal plane on 
each profile. 

In the first paper on type silhouettes published in Biometrika in 1928 *, it was 
pointed out in discussing the Pbch-Weninger photographs of the West African 
Negro, that Martin's definition of the tragion was so obscure that it was not possible 
to use it in practice. Miss Tildesley kindly furnished the following definition of the 
ear- point termed the tragion as identified by her instructors in Munich: 

“In my practice the tragion was determined as the point of intersection of two 
tangents; the one a common tangent to the anterior margin of the tragus and 
crus helicis , the other a tangent to the upper border of the tragus passing through the 
nasion as seen in profile." 

While this emended definition still leaves room for personal equation, and 
possibly is inexact, as it is difficult to draw in space tangents to curves only seen in 
profile, and difficult to see a subject in profile except on the focal plane of a 
camera, it is safe to assume that the “tragion" as marked by Miss Tildesley is always 
anterior to the opening of the auricular passage, and probably slightly above the 
highest point, the hyperporion on the superior margin of that opening. The point 
where the axis of the ear-plug inserted into the auricular passage cuts the median 
sagittal plane is the mesoporionf, and the auricular point A is defined to be 
the one where a circle of 8 mm. radius having the mesoporion as centre is 
met by its upper tangent from the projection of the sub-orbital point on to the 
median sagittal plane. The tragion as marked by Miss Tildesley probably lies close 
to this auricular point A , but anterior to it and slightly inferior to it. It was 
accepted as the auricular point A> which serves as the origin of the axes from which 
the co-ordinates are measured. 

Now the actual origin of co-ordinates does not matter as far as the silhouettes 
are concerned, if we obtain for each subject a point which may be said to be 
anatomically the same. But it is of importance to have the same point on all type 

* Biometrika , VoL xx B . pp. 389—400, 
t Defined Biometrika , Vol. xx B . p. 869. 



Discussion of M. L. Tildesley's Data 


47 


contours , if we wish to compare their measurements one with another. Unfortunately 
Miss Tildesley, having deserted the hyperrhinion and mesoporion (leading to the 
auricular point) for the nasion and her tragion, did not provide by aid of the 
projection calipers the projected distance between her points. She did, indeed, 
record head measurements of her Albanian soldiers, but as we have seen there is 
no one of these which corresponds with sufficient accuracy to any length which 
can be measured on the photographs to make it possible to determine the individual 
scales of reduction. 

The steps adopted were as follows : The first step was to trace from the photo- 
graphs with extreme care the outline of each head, introducing the marked 
positions of the sub-orbital poiut and the tragion. These drawings were then 
enlarged by two stages to exactly four times their linear dimensions by aid of a 
Coradi precision pantograph. The outlines thus obtained, which were evidently 
rather smaller than those of the living head, were then divided up by the system 
of co-ordinates shown in Diagrams I and II, and their measurements were taken. 

The origin, or point A represented by Miss Tildesley ’s tragion, was first joined 
to the hyperrhinion N, the nearest point to A on the nasal bridge, and thus we 
obtained the base line NAB, one of the co-ordinate axes, the second being the per- 
pendicular to it through A. From these axes the co-ordinates of all points on the 
outline were determined, except those of the mouth and chin. This method of 
measurements, and those used in the case of the facial outline, were very similar 
to those adopted in constructing the English male type silhouette, though slight 
modifications which can be seen by comparing the figures were introduced in order 
to make the best use of the new material* 

It was next necessary to reduce the measurements of the enlarged profiles to 
a uniform scale as well as this could be done under the circumstances. After 
comparing several caliper measurements made on the living head with the corre- 
sponding lengths on the profiles to which they may roughly be supposed to correspond, 
the following method was found to be the least unsatisfactory. Neither of the 
terminals of the facial height as measured by Miss Tildesley can be located at all 
accurately on the profiles; but it is not unreasonable to suppose that their facial 
height will bear an approximately constant ratio to the line joining the hyper- 
rhinion to the progenion on the actual life-sized silhouette. Now we define the 
progenion as the point on the outline of the chin farthest removed from the line 
joining the protion to the stomion. The protion is the most anterior point of the 
nose obtained by drawing a tangent to it perpendicular to the NA B axis and the 
stomion is the meet of the lips. We require only to draw a tangent to the chin 
parallel to the protion-storaioif joinf. All this can be done on the outline. By 
means of this ratio of measured nasion-gnathion length on the living to the 
measured hyperrhinion-progenion lengths on the enlarged photographic profiles, 
it was possible to reduce all the enlarged photographic profiles to a common scale, 

. * 01. Biometrika , Vol. xx®. pp. 890—896. 

t Biometrika , Vol. xx®. p. 396. 



48 


The Albanians of the North and South 

which would not, however, be necessarily the true actual size. These ratios ranged 
from 1*250 to 1*553 for the Northern Group and 1*274 — 1*567 for the Southern; the 
nasion-gnathic length being the greater in every case. All the measurements of 
each enlarged profile were then multiplied by its particular ratio and the means of 
these adjusted measurements were found for each co-ordinate. Thus type profiles 
ei and eg were obtained with these means for the Northern and Southern Groups. 
But these type contours will not yet be of life-size. A further adjustment was 
needed for the contours were clearly too large. Now the fifty English male students 
whose silhouettes were used in the construction of the English type had a mean 
caliper glabellar occipital length of 194*4 mm. The maximum length from the 
glabella to the back of the head on the type silhouette was 211*8. These two 
lengths will not necessarily coincide in direction, but their difference was treated 
as the same for Albanian and English types. This difference was 17*4 mm, which 
would in the first place measure thickness of the hair, in the second pressure of the 
calipers, and lastly be possibly a result of non-coincidence of direction, the hair not 
being equally thick over the back of the head. Thus we obtained a rough process 
for finding the absolute size of the silhouettes. The Albanians being soldiers had 
shorter hair (see photographs) than the peasants as a rule would have and would 
be likely to correspond in this respect more closely with the English undergraduates. 
Measuring the maximum length from the glabella on the 67 Northern Albanians 
of the photographs the mean gave 186*6 mm. If we add 17*4 mm. we have for 
maximum length from glabella on silhouette 204*0 mm. For the 67 southern 
Albanians the mean length on the living head was 176*6 mm. or the corresponding 
silhouette length should be 194*0 mm. 

On the enlarged type contours, and e* , obtained as described above, the maxi- 
mum Head Length for the Northerners was 217*7 and this has to beVeduced to give 
204*0 *, the reducing factor is therefore *937. In the same manner for the Southerners 
the enlarged type contour gave a maximum Head Length of 207*2 and this needed 
to be reduced to 194*0 or the reducing factor was 936. The near equality of these 
two reduction ratios is satisfactory as the Northerners and Southerners were photo- 
graphed in random order, and it indicates that displacements of camera, chair and 
subject in chair were fairly randomly distributed between both groups. These 
reducing factors were applied to all the measurements on the enlarged type contours, 
and final presumed life-size contours obtained for the two groups. These are repro- 
duced to half-scale in Diagrams I and II, the corresponding silhouettes being given 
on Plates I and II to half life-size. The method by which they have been obtained 
from the photographs is frankly admitted to be a crude one, but it appeared the 
best that could be applied to the defective data available. 

Having obtained in the above manner the Northern and Southern Albanian 
male silhouettes we puzzled once more to find any quite independent method of 
checking our result. Looking up the literature we discovered Leopold Gltick^s 

* Actually Diagram I and Plate I have been drawn slightly in excess of the true values; they 
require reducing in the ratio 102 to 103. 



Discussion of M. L. Tildesley’s Data 


49 


statement (see our p. 35) that the “ Entfernung des Ohrloches von der Nasenwurzel ” 
for the measurements on hiB 30 Northern Albanians was in the mean 103 mm. Now 
“ the distance of the ear-passage from the root of the nose” is a very vague expression. 
Gliiok probably took the expression from Virchow's list in Neumayer's Anleitung zu 
wizdemchaftlxchm Beobachtungen au/Reisen , 1875 (see also ZeiUchrift filr Ethnologie, 
Bd. xvii. 1885, S. 99 — 102), where the measurement is named in precisely the same 
way, but this is not helpful, as Virchow does not define it, nor refer to any instrument 
for taking the measurement*. 

If we suppose the “ root of the nose ” to be the most posterior point of the nasal 
bridge we have still to define the word “ posterior.” We shall not reach precisely 
the same point, if we define posterior in regard to the Frankfurt Plane, or as the 
point nearest to the “ Ohrloch.” Finally if we agree to suppose GltLck's measurement 
to correspond very closely to our hyperrhinion to mesoporion distance, we have to 
remember that this is not Tildesley’s nasion to tragion length. We must first reduce 
Glti ck’s 103 t o our distance from auricular point to hyperrhinion and this will be equal 
to V 103 a — 8* s 102*7 nearly. Now the auricular point is not Miss Tildesley’s tragion, 
which latter may easily be 1*5 to 2*5 mm. anterior to the auricular point. Hence 
we should expect on the basis of Gliick’s measurement that the distance of A to N 
on silhouettes of the Northern Albanians might lie between 100*2 and 101*2, with 
a mid-value of 100*7. On our silhouette it is 100*9 for the Northern Group. 
Considering the small number of Gliick’s cases, the area, wider than Miss Tildesley’s 
northern area from which he drew them f , and further, the fact that the distance 
from mesoporion to hyperrhinion varies from 98*3 to 101*2 as we pass from south 
to north of Albania, the above accordance is, perhaps, all we could look for. It, of 
course, depends on our interpretation of the term “ Entfernung des Ohrloches von 
der Nasenwurzel ” being correct. Assuming that is so, our silhouettes are hardly 
likely to be at most more than 1 % to 2 % in error as to their absolute size. Had 
Miss Tildesley simply recorded the measurement we suspect to be that taken by 
Gluck J, and used the ear-plug when photographing, all the laborious computing 
work just described would have been saved, and the absolute sizes of the silhouettes 
would have been ascertained without any of the doubts arising from hypotheses such 
as we have been compelled to make. 

* Schmidt ( AnthropologUche Methodcn , 1888, S. 107) identifies the “Nasenwurzel” with “ der 
tiefsten Stelle der EingeBattelung zwieohen Stirn und Nase,” but if a point on a curve is to be “deepest” 
it must be with regard to some ehord of the curve and he does not define this chord. In the next 
sentence he speaks about the nasal suture and apparently identifies the naBion with this 11 deepest point 
of the nasal bridge.” Martin { Lehrbuch der Anthropologies 1928, fid. i. S. 147) very properly says that “ von 
manohen Autoren wird falschlioh die am tiefsten eingesattelte Stelle der Nase als 'Nasenwurzel' 
bezeiehnet.” He identifies the “Nasenwurzel” with the Nasion, but he does not say how the “tiefsten 
eingesattelte Stelle der Nase,” if required is to be found. We define it as the point on the median 
sagittal seotion of the nose nearest to the mesoporionic axis. This is easy to find in the case of the type 
silhouettes. 

t Hfs men came from Novibasar, Ipek, Djakova and Prizren (see map, p. 24), part of the former 
Tnrkish Albania, now outside the northern boundary of present Albania. 

t It is quite simply taken with Pearson's head spanner, and the mesoporion quite simply found on 
the photograph by aid of Pearson’s ear-plug. 

Biometrika xxv 


4 



50 


The Albanians of the North and South 

If the two silhouettes, i.e. that for the Northern and that for the Southern 
Albanians, be superposed so that hyperrhinion and stomion agree, it will be found 
that the whole of the faces are in almost complete accordance. This facial likeness 
we had already seen must arise from Miss Tildesley’s facial measurements. On the 
other hand the tops and backs of their heads show marked divergence ; the Northern 
head projects beyond the Southern in the neighbourhood of the ophryon. From the 
crinion the hair line* of the former lies increasingly outside that of the latter, and 
at the back of the head between obelion and hystation the difference amounts to 
8 to 9 min. This excess, if it diminishes somewhat, is maintained right down to the 
lophion. If the auricular points be superposed, and the two lines from those points 
to the hyperrhinions be brought into contact we find that the Southern type lies 
entirely inside the Northern right away round from gulion to lophion, the two faces 
having practically parallel contours from progenion to the ophryonic region. 

To emphasise the extreme smallness of the Albanian skull, whether north or 
south, we may take the silhouette type contour of the Englishman from the 
pocket of Biometri/cdy Vol. xx B , and superpose it on that of the Northern Albanian, 
so that the nose contours, which fit well, are in agreement, the Englishman's 
hyperrhinion being slightly above that of the Albanian. The Englishman has 
a slightly longer upper lip, but his chin and forehead retreat, his crinion being 
slightly below and inward of the Albanians. Beyond the crinion the English type 
is bigger than the Albanian, rising about 4 mm. above it at the apex, and it is 5 
to 6 mm. horizontally outside it between obelion and hystation. This excess is 
maintained, if decreasingly, right down to the lophion. If as previously we make 
the auricular points and the hyperrhinion-auricular point lines to coincide, the 
northern Albanian head lies almost inside the English, except for a slight projection 
of the former in the region of the crinion, and a somewhat mQre significant projec- 
tion between plakion and obelion. The southern Albanian type lies wholly inside 
the English male type whether we make the auricular points and the auriculo- 
hyperrhinion lines, or the auricular points and Frankfurt Horizontals to coincide. 
Further evidence if required of the smallness of the Albanian, especially the 
southern, heads may be found by superposing the type silhouette contours of the 
English female head or even the West African Negro head. . 

The comparative value of the silhouettes of racial types can only be settled 
when many others have been constructed. It will be interesting to compare those 
of the Albanians with silhouettes of other Balkan peoples. Some of these are in 
preparation, and they will, we have reason to hope, be free of the omission which 
detracts so much from the value of the present pair. 

# This term “hair line” is used purposely because the measured auricular height of the Southern type 
is actually 0*7 mm. in exoess of that of the Northern type. (See Table I.) The vertical height from A on 
the Northerners’ silhouette is 8*8 mm. greater than that of the corresponding height on the Southerners’. 
An examination of Plates HI— -V suggests that the Northerners' hair may well account for this difference ; 
the average thickness of hair at the back of the head of 50 young Bnglishmen was of the order of 
17 mm. 



Biometrika, Vol. XXV, Parts I and II 

Tilile.sley, The Albanians of the North and South 


Plate III 



(iii) Moslem. (iv) Moslem. 

Albanians of the North (1st series). 





Biometrika, Vol. XXV, Parts I and II 

Tililolcy, The Albanians of the North anti South 


Plate IV 



(iii) Moslem. (iv) Catholic. 

Albanians of the North (2nd series). 



Biometrika, Vol. XXV, Parts I and II 

Tildesley, The Albanians of the North and South 


Plate V 



(iii) Moslem. 


Albanians of the South. 


(iv) Moslem. 



Discussion of M. L Tildesley’s Data 51 

Two points, which, if noticed before, are at any rate re-emphasised by Miss 
Tildesley’s work, are the following: 

(i) There are at least two differentiated groups (or we might say races) in 
Albania, those of the extreme North and of the extreme South. 

(ii) Both races have from the European standpoint small, and in the case of 
the Southern Group extremely small heads. We might add that it is possible for 
two differentiated groups to have faces closely alike, or we must accept the fact 
that a strong facial resemblance by no means connotes racial identity. Thus 
physiognomic characters do not necessarily provide the best method of dis- 
criminating races. 


DESCRIPTION OF PLATES 

I. Type Silhouette of the Northern Albanian Group. 

II. Type Silhouette of the Southern Albanian Group. 

III. Examples of the Photographs of the Albanians of the North (1st Series). 

IV. Examples of the Photographs of the Albanians of the North (2nd Series). 

V. Examples of the Photographs of the Albanians of the South. 


4 — 2 



A COMPARISON OF THE SEMI-INYARIANTS OF THE 
DISTRIBUTIONS OF MOMENT AND SEMI-INVARIANT 
ESTIMATES IN SAMPLES FROM AN INFINITE 
POPULATION. 

By JOHN WISHART, M.A., D.Sc., Clare College, Cambridge. 

The appearance of yet another paper on the sampling problem* directs atten- 
tion to the success which has attended of recent years the efforts of workers in this 
field. The general problem considered by one group of workers is the following. 
Let there be given a population, supposed infinite in extent, but subject to this 
having any law of distribution with finite moments. It may be a population of one 
or many variables. The population may be regarded as completely specified by a 
knowledge of all its characteristic parameters, which may be moment coefficients 
or semi-invariants, or expressible in terms of these. For a sample of size n drawn 
at random from this population we may calculate in some manner certain functions 
which are to be regarded as estimates of the population moment coefficients, or 
semi -invariants. The simultaneous distribution in repeated samples of the various 
estimates will depend upon that of the parent population, and the problem I wish 
to take up deals with the determination of the moment coefficients, or semi-invariants, 
of this simultaneous distribution. Prior to 1928 certain individual results only had 
been worked out ; in that year two independent papers of great importance appeared. 
R. A. Fisher f showed that if we define as estimates of the populatioiksemi-in variants 
(# c r ) certain functions ( k r ) of the sample observations by means of the simple 
property that the mean value of k r for an infinite number of samples is to be K r \, 
then the semi-invariants of the simultaneous distribution of the k' s are peculiarly 
simple in form, compared to analogous expressions derived in other ways. These 
semi-in variants can be derived algebraically, or more simply by following out 
certain straightforward combinatorial rules. Fisher’s paper marked a great advance 
in showing the possibility of beginning a systematic tabulation of the required 
formulae, a thing that had not before been possible. In his paper all formulae up 
to the tenth degree were given, together with a number of special interest of the 
twelfth degree. In addition the paper showed how the methods could readily be 
applied to multivariate populations, and a number of the more general formulae 
were given. 

The other paper was by Craig§, who dealt with the simultaneous distribution 

* N. St Georgescu: Biometrika % niv. 1982, pp. 65—107. 

t R. A. Fisher: Proc . Land . Math. Soc. (2), 80, 1929, pp. 199—288. 

t [The reader must bear in mind that the mean value of k r is not the value more likely than any 
other to occur, i.e. it is not the modal value. Ed.] 

§ C. 0, Craig: Metron % vn. 1928—9, pp. 8 — 74. 



John Wishart 


03 


of the sample moment coefficients (m r ), as ordinarily defrajd/biit chose to express 
this distribution by means of its semi-invariants. His results do not, therefore,' 
have the same peculiar simplicity of expression that Fisher's have. Craig was able, 
by algebraic methods, to deduce quite a number of formulae involving moments 
not higher than the fourth. Now in the paper already mentioned St Georgescu 
derives precisely the same functions as Craig, i.e. the semi-invariaribs of the 
simultaneous distribution of sample moments. The interest of his paper is in the 
presentation of a different method, for he describes a combinatorial procedure 
rather like that of Fisher; although it is not as simple in its rules, just as the final 
results of Craig and St Georgescu are not so simple and compact as those of Fisher. 
Quite obviously, then, those formulae which are given both by Craig and St 
Georgescu are identical, although the identity is not immediately evident, since 
St Georgescu has expressed his results in terms of AT, one less than the size of the 
sample, and has used a different notation. He has, however, a number of formulae 
which do not appear in Craig's paper, for, confining himself to moments not higher 
than the fourth, St Georgescu gives all, or nearly all*, the formulae up to weight 
11, together with certain only of the formulae of weight 12, and two high order 
results for normal populations only. Of these, the formulae of weight 11 are new 
in the sense that there is nothing to correspond with them in Fisher or Craig, 
although in the case of the former methods have been de vised f for deriving in a 
fairly simple way new patterns from those of lower order, and thus it would mot be 
difficult to add to the results already published. 

n 

Now m r = 2 (fy — xy/n, and the problem, as first taken up by "Student," 

*=i 

Tschouprow and Church, dealt with the distribution in random samples of the 
estimates m r , expressing this distribution by means of its moment coefficients, 
which were worked out in terms of the moment coefficients of the parent popula- 
tion. As results in this form are still required by some workers, though not by all, 
it becomes important to see how the Fisher results may as required be transformed. 
There are three stages in the process : the first consists in finding the semi-invariants 
of the distribution of the m r estimates, expressed in terms of the population semi- 
invariants, from those of the k r estimates (i.e. deducing the Craig-St Georgescu 
results from those of Fisher J); the second consists in turning these semi-invariants 
of the required distribution into moments ; then, since the results are still expressed 
in terras of the population semi-invariants, the last stage is to turn these latter 
into moments. The last two stages are a matter of routine algebraic transformation, 
using the known relations between moments and semi-invariants, but the methods 
of the first stage are less obvious, and it is one of the purposes of the present paper 
to describe this transformation, which of course is reciprocal. Later a new result is 
worked out, and it is also shown how many of the terms in certain Fisher formulae 
of high order may be deduced from corresponding terms in formulae of lower degree. 

* He has not given 8 (8 2 4 ). 

f B. A. Fisher and J. Wishart : Proo. Loud . Math . Soe . (2), 88, 1981, pp. 195—20 8. 

£ [Is not this to admit that the Oraig-St Georgescu results relieve the worker, who is dealing with 
moment ooef&oients, from at least one stage of his labours? Ed.] 



54 


Moments and Semi-Invariants in Sampling 

Formulae which involve sample estimates no higher than the third degree can 
be readily transformed from the one notation into the other. Since 


k 2 = - 1 ) ( 1 )> 

k* = n 2 m s /[(n - 1) (n - 2)} (2), 


the only change evidently consists in introducing a constant multiplier. Thus let 
us begin with Fisher's formula for #(2*3), the (21) product semi-invariant, or 
moment about the mean (since up to the third degree semi-invariant and moment 
are identical), of the simultaneous distribution of k% and k$ in repeated samples. 
The notation is easy to follow (and has been adopted by St Georgescu), for the 
figures in large type relate to the estimates involved, while the exponents give the 
nature of the semi-invariant evaluated. The formula is Fisher's no. (8), 

# (2 a 3) = * 7 /n a 4 16/c 5 * 2 /{n(n — 1)! 4 12 (2/i — 3) /c^ 9 /[n(n— l) a ) 4- 48/e a * 2 a /(n — l) a . 

In the notation of St Georgescu, we now obtain S (2 a 3), i.e. the (2 1) semi-invariant 
of the simultaneous distribution of and m 3, by multiplying the above result 
throughout (see (1) and (2)) by 

— iy (n — 1 ) (n — 2) 

We then have 

S (2 a 3) - (n - If (n - 2) * 7 /n fl 4 16 (n - l) a (n - 2) * 6 * 2 /a* 

4 12 (n — 1 ) (n — 2) (2n - 3) 4 48 (n — 1 )(n — 2) 

This agrees with Craig's result ( loc . cit. p. 55, formula for $21(1/3, 1/3)), and also 
with St Georgescu 's if we make the substitution n«if + l, 

It is to be hoped that we shall in time settle down to a uniform and satisfactory 
notation for the semi-invariants. What Fisher writes k t is written \ r by Craig and 
8 r by St Georgescu. In both these cases the influence of Thiele is apparent, but to 
both there are objections. If we are to extend the practice of having corresponding 
Latin and Greek letters for sample estimate and population parameter respectively, 
a practice that has much to commend it, then that rules out 8 straight away as not 
being a Greek letter. In any case s is already appropriated for standard deviation. 
There is not much to choose between \ r and * r , but on the ground that the Latin 
l for sample estimate is less satisfactory than k , liable as it is to be confused with 
the numeral one , we would advocate the use of k and k throughout. 

When a fourth or higher order estimate comes in, the transformation is less 
simple. Fisher, in his original paper (loc. cit. para. 10), gave a general demonstra- 
tion of the method to be followed, but it will not be out of place here to give the 
details, illustrating by means of an example or two. Let us take the case of k (4 # ), 
the variance of k Ai compared with that of WI4, denoted by S(4 a ). The latter formula 
occupies three and a half lines of print in St Georgescu's paper, and contains 
a number of quite complex terms. The former reads 

^(4 a )-#c 8 /n4l6^^/(n-l)448/r5^/(w-l)434/r4V(H-l)472n/r 4 ^ a /{(n--l)(n-2)} 
4 144n#j a *|/{(n — l)(n — 2)j 4- 24n(n 4 1) * f 4 /{(w — l)(n — 2)(n — 3)}. 



John Wishart 


55 


The number of terms is the same, but the simplicity of each is very marked. In 
fact the occurrence of common factors in the second and third, and fifth and sixth, 
terms would enable the formula to be abbreviated by taking these terms together, 
while the writing of N for n - 1 would also somewhat shorten the formula. It is 
of more immediate interest, however, to see how the Craig-St Qeorgescu result 
can be derived from the above. By definition we have 

h . + l)m 4 3n a mt a 

4 (n — 1 ) (n — 2) (n~3) ( n -2)(n-8) 

Thus we may write m 4 » jd& 4 4 qlcf t utilising the value of k 2 already given in (1), 
where p and q stand for the two factors involving the size of the sample 

p « (n - 1) (n - 2) (n - 3 )/{n*(n 4 1)}, q = 3 (n - l) 8 /{n 2 (n + 1)}. 

We are to consider now the simultaneous distribution of fc 4 and k 2t and to find 
the distribution of m 4 , a certain known function of these quantities. The moment 
generator of the distribution is given by the operator 

exp (r (pd/dtx + qd'/dtf)}, 

while the operand is 

1 + M (4 i) <, -M (2) t, + /* (4*) M (4 2) ^ M (2«) ^ + . . . . 

The operator is expanded and the differentiations carried out, after which ti and tr 
are put equal to zero. We have the following series in r : 

1 + {pfi (4) 4 qp (2 2 )} r + {pV W + Zpqii (4 2 2 ) 4- q*fi (2 4 )} t*/2 ! + (4), 

the binomial character of the coefficient of r T /r\ being evident. In general the 
notation fi(a b ) denotes the moment coefficient corresponding to the semi-invariant 
tc ( a b ). The series in r is the moment generator of the distribution of ra 4 . To obtain 
the semi-invariant generator we expand the logarithm of (4) in powers of t r /r\. 
We get 

{ pp (4) 4 qp (2 2 )} r 4 [p* f/u (4*) - p* (4)} 4 2 pq {/* (4 2 2 ) - p (4) p (2 2 )} 

+ 9* {m (2 4 ) — /i 2 (2 2 )}] 7^/2 1 4 (5). 

I he term in t 2 /2! in this is the # a , or variance, of the distribution of m 4 , i.e. the 
required result. But it is expressed in terms of the moment coefficients of the 
simultaneous distribution of & 4 and k 2) whereas it is the semi-invariants which are 
known from Fisher’s work. The relations connecting moments and semi-invariants 
are, however, well known. Those we shall require are 

Mi = *u 
Ma ~ + *i 2 , 

Mia * 5 *ia 4 2^ u /r 01 4 * 03*10 4 *io*oi a > 

M 4 5=1 * 4 4 4* a 4 3#t 2 4 6*3 *i 2 4 *i 4 

The term in t*/2 ! in (5) may then be written 

8 (4 2 ) - ftc (¥) 4 2pq {* (4 2 2 ) 4 2* (4 2) * 3 } ' 

+ 9* {* (2 4 ) 4 4* (2 2 ) 4 2 ac 2 (2 2 ) 4 4 * (2 2 ) ^ 2 ) ( 6 ). 



56 


Moments and Semi- Invariants in Sampling 

This is the relation sought, into which the known values of p and q may be 
inserted. It is seen to involve, in addition to * (4 2 ), a number of other results, all 
tabulated in Fisher's paper. Reciprocally, of course, it is also possible to express 
. k (4 a ) in terms of a serins of the Craig-St Georgescu results. 

The terms of 8 (4 2 ) may now be worked out one by one. Take for example the 
term in * 2 4 , the only term that survives when the parent population is normal. In 
this case the entire middle term of (6) (that in pq) vanishes, since k (a2 ft ) is zero 
for normal populations when a > 2. We make use of the known results 


* (4 2 ) » 24n (» + 1)4 4 /{(n - 1) (n - 2) (n - 8)} (7), 

* (20 - 2 r ~ 1 (r - 1) ! Kffcn - VjT* 1 . 

Substitution in the above formula leads without difficulty to 

S (4 2 ) = 24 (ti - 1 ) (4n a - 9n + 6) #* 4 /n 4 (8). 

as given by Craig, St Georgescu, and others. 


Actually, in this case, it would probably be nearly as simple to obtain the result 
by direct algebraic methods, but the example is only an illustration of what is 
possible. The series (5) has in fact been extended by the writer to the terms in 
t 4 /4! and used to work out the normal term of S (4 4 ) in terms of *(4 4 ) and other 
k results, thus checking the result given by Craig and later by St Georgescu. With 
such a high order result it is obvious that direct algebraic methods would be 
exceedingly laborious. 

For this, the normal, term of k (4 2 ) or 8 (4 2 ), there is not a great deal to choose 
between the formulae (7) and (8) on the ground of simplicity. It is perhaps 
instructive to choose another term in the formula to show a more striking 
difference. Suppose we are required to find the term in * 4 a . For #(4 2 ) this is 
simply 34/(n - 1), being derived from the two patterns 

3 14 2 2 

13 4 and 2 2 

4 4 4 4 

which can be set up in 16 and 18 ways respectively, and with coefficient l/(n - 1) 
in each case. To find from (6) the term in * 4 a of 8 (4 a ) we require in addition the 
following results : 

* (4 2 2 ) « . . . 4 (7n — 10) tc£/{n (n — l) 2 } ..., 
k (2*) «... 8 (4n a - 9n + 6) * 4 2 /{n 2 (n — l) 8 } , . . , 

k ( 2 2 ) * ta/n .... 

These are taken from formulae nos. (12), (14) and (1) of Fisher's paper, already 
cited. 

After substituting in (6) and reducing as far as practicable, we have, for the 
required term in # 4 8 , 

2(71 — 1) (17n 4 - llln 8 + 30971* - 405n + 207)/n 8 . 



John Wishart 


57 


So much, then, for the direct semi-invariants of a single moment or semi- 
invariant estimate. When we come to consider the simultaneous distribution of 
two or more sample estimates, the procedure is a little modified. To serve as an 
illustration of method let US' find k (2 4), and from it 8 (2 4). The first of these is 
the first order product semi-invariant (or moment) of the joint distribution of le% 
and k t (in fact its * u ). The second is the corresponding parameter of the joint 
distribution of mt and m*. For k (2 4) the required patterns are 

2 4 6 1 3 4 1 2 3 

2 4 — 112 12 3 

2 4 2 4 "' 

* (2 4) = Kt/n + 8/c* Kil(n — 1) + 6*»*/(n — 1). 

To transform this into 8 (2 4) we shall also require k (2*) and k (2*), 


1 1 

2 

1 1 

2 

2 2 



2 2 2 | 6 
2 2 2 I 


k (2 2 ) = Kt/n + 2**®/(n — 1 ), 

2 11 4 11113 

• 1 1 2 1 1 1 j 3 

2 2 2 2 2 2 1 


1 • 

2 

• 1 

2 

1 1 

2 

2 2 



k (2 8 ) = K t /n* + 12*4K a /{n (n - 1)) + 4 (n - 2) /ej*/{n(» — 1)*} + 8 * 8 */(ft — 1)*. 

We now consider the simultaneous transformations 

mt = pkt +■ qfcf, 

mt = rk t , 

where p = (n — l)(w - 2)(n — 3)/{n*(n + 1)), 

}“ 3(n — l)*/{n*(n + 1)}, 
r«= (n - 1 )/n. 

The ^-generator of the simultaneous distribution is given by 
and the operand is, as before, 

l+ M (4)« 1 + M(2)«. + /*(4 i )|‘ + M(42)^ 1 + M(2 t )|‘+.... 

Performing the differentiations and then putting ^ = we have, for the 

/A-generator, 

l + Ti{p/A(4)4-9/a(2 a )j +T 9 r/A(2)4 i T X T 8 {pr/LA(4 2)-f qrfi(2*)} + .... 

The ^-generator is the logarithm of this, and to find the (1 1) product moment of 
mi and m « we require to expand the logarithm and find the term in tit*. Quite 
obviously this is 

prp (4 2) + qrp (2*) - {pp (4) + qp (2*)) rp (2) 

=pr (4 2) — /i. (4) p (2)} + qr {/t(2*) - p (2*)/x (2)}. 



58 


Moments and Semi-Invariants in Sampling 


On converting /as to tc’s this becomes 

5(2 4) = prtc (4 2) + qr {/c (2 8 ) + 2k 2 k (2 2 )}. 

On substitution of the values of p , q and r and of the expressions given above for 
the semi-invariants involved, we get 

5(2 4) = (n - 1)* (n 2 - 3 n + 3) /*/** + 2 (n - 1) (7w a - 18n + 15) w 2 /n* 

+ 6 (n - 1) (n - 2? **> 4 + 12 (n - l) 2 /c*V, 
as already given by Craig and St Georgescu. 

Enough has been done to indicate the procedure to be followed in the case of 
more complex formulae. In all cases the simplest final results are obtained when 
we deal with the semi-invariants of the ^-estimates, which fact renders them 
particularly suitable when a storehouse of information is required for reference 
purposes. In addition, anyone who has mastered the technique may work out 
ab initio many of the Fisher results in a few minutes, a procedure that will often 
be quicker than looking up the required formula. In such cases the transformation 
which has been described will be useful when the results arc required in other 
forms. 

Two results of high order, namely 5(4 4 ) and 5(3 6 ), have been given by 
St Georgescu for the special case of the parent population being normal. The first 
agrees with the corresponding result given earlier by Craig, which, as we have said, 
checks with the Fisher result for tc (4 4 ) *. But it ought to be pointed out that the 
St Georgescu result for 5 (3 6 ) is in error, k (3 6 ) was first worked out in full for 
the normal case some two years ago by Dr Fisher and the present writer*. The 
result is 

* (3 8 ) « 466560a 8 (22n a - 111a + 142) * a 9 /|(n - l) 6 (n - 2f\ (9). 

Now since 'm 3 = (n— l)(n — 2) A a /n a , we may obtain 5(3 8 ) by multiplying the 
above result by {(n — l)(w — 2 )/w 2 ) 8 , and we have 

5 (3 8 ) = 466560 (n - 1 ) (n - 2) (22n* - 111 n+ 142) /tf/V (10). 

If we write n for N + 1 in the St Georgescu result, we get 

3265920 (n - 1) (n - 2) (4n 2 - 21n + 28) * a > 9 

There is no doubt remaining as to the correctness of (9), and therefore (10). It was 
determined on more than one occasion, and has recently been reworked, by the com- 
binatorial method, and carefully checked. A further check is provided by the relation- 
ship between fi{ 3 s ) and 3 8 2“ 9 ), which latter is the sixth moment of the distribution 
of the ratio g% = kt/lcf, the first measure of departure from normality, differing by 
only a constant from Fisher's recurrence relation for g x f enables one, after 

some fairly heavy algebra, to obtain the second, fourth and sixth moments of g 2 in 
succession, and so to check k (3 e ). This also has been done more than once, with 
the same result each timet. 

* See J. Wisharfc : Biometrika , xxii. 1930, p. 237. 

f R. A. Fisher: Proc . Roy . Soc. A, 180, 1980, pp. 16—28. 

t J. Pepper, in Biometrika , xxiv. 1932, p. 60, has ealoulated ^ (3 s 2~ 18 ), the result being oheoked by 
Dr FiBher by his combinatorial method. 



John Wishart 


69 


To fill a gap in St Georgescu’s table of results, I have worked out * (8 2*), from 
which S(3 2*) may be directly derived by multiplying by (n — l) t (n — 2)/n t . The 
former was obtained by the combinatorial method, and carefully checked by the 
rather lengthy process of direct algebra. The result is 

*(8 2*) - Ku/n* + 48/c# **/{«* (n - 1)} + 8 (16n - 29) /{»»(» - 1)*( 

+ 8 (88n* — 99n + 75) « 7 * 4 /{«»(n - 1)*| 

+ 16 (26w* - 98«* + 127n - 58) ««*#/{»» (n - l) 4 } 

+ 720# 7 *, 9 /{«*(ra - l) 9 } + 96 (31n - 53) *,*,*,/{« 9 (n - l) 9 } 

+ 576(9n 9 - 23w + 16)* 5 /ie 4 *j/{n 9 (n — l) 4 } 

+ 288 (9n 9 - 32n + 26) W/{« a (n - l) 4 j 
+ 96 (41n 9 - 1 29n + 1 1 1 ) * 4 9 * s /(n 9 (« - 1 ) 4 } 

+ 3840«j ***/{?? (n - 1) 8 ) + 8640 (2a - 3) * 4 W/(» (a - 1) 4 } 

+ 1152 (5a - 12) (» - 1) 4 } + 5760W/{(« - 1) 4 } (11). 

One point of some interest that arises out of the derivation of such high order 
formulae is that under certain circumstances part of the result at any rate can be 
derived by the application of simple formulae from corresponding terms of results 
of lower degree that have been already evaluated. This point has not been studied 
at all systematically, and in fact a wide field of study awaits the research worker 
who cares to take the problem up. The formulae arise from consideration of the 
way in which patterns of a certain size may be expanded by the addition of new 
columns and rows, and of the number of ways in which such a change can be 
effected. One or two results of this character have already been given, but only 
for the case of the parent population being normal. Thus we have the result given 
in a previous paper* that 

: (12> ' 

which gives the normal term of /c(p q 2 r ), which will only exist for pq even, in terms 
of that of /c(p q ). This relation should in fact have been extended to k (3*4 6 5 c ... 2 r ), 
where 3a + 46 + 5c + ... **pq, as has been indicated by St Georgescuf with the 
parallel formula for his form. A particular case is the well-known one, putting 

p = 2, 1, 

k (2 r+1 ) = 2 r r\ tC 2 r+1 1 (n — l) r (13), 

giving immediately the normal term of the (r + l)th semi-invariant of the dis- 
tribution of 

An extension of this work to the case of non-normal populations can be made 
under certain circumstances. It is obvious that if any term of a formula which 
involves a power of k% can be derived from the corresponding term of a formula 
of lower degree, then this will enormously reduce the number of terms to be 
evaluated by combinatorial methods. In particular it will seldom be needful to 
evaluate patterns of large size in which most of the cells are unoccupied, a type 
to which it is usually somewhat difficult to assign numerical coefficients. For 
example 8 out of the 14 terms in tc ( 3 2 4 ) contain a leaving only 6, 4 of which 

* J. Wishart ; Biometrika , xxxx. 1980, p. 284. + L oc. ciu p. 97. 



60 


Moments cmd Semi- Invar twits in Sampling 


are very easy, to be directly determined. Again k ( 2 7 ) contains 34 terms, 21 of 
which contain at least one A formula which embraces all such terms from 
formulae of the type k ( p 2 r ) is the following : 

The term in ... in k (p 2 r ) is obtained from the term in 

/cz a *i b ici c ... in tc(p 2 *~ 1 ) by multiplying the coefficient of the latter by 

2 ™+ i r!(r+p-l)l (U) 

(s - 1 )!($ + p — 2 )! (r — 8+ 1 )! (ft — l) r ~*+ x 
We have, of course, that 3a + 46 + 5c + ... * 2 (s — 1)+ p, and p may take any 
integral value from 2 upwards. The formula is subject to the one exception that 
owing to symmetry it has to be slightly modified to give the normal term in 
tc( 2 2 r ), but for this we have formula (13) above. The other terms in k ( 2 r+1 ) are 
given correctly by (14). 

As an illustration let it be desired to find the term in tcitcjtcf in /c ( 3 2 6 ). Here 
a = 2 , 6 = 0 , c=l, p — 3 and r = 6 . It follows that s — 5, and the required term 
will be obtained from that in * 6 /e 8 2 of k ( 3 2 4 ) by multiplying the coefficient of the 
latter by 

2 2 6 ! 8 ! 

4!6!2!(n-l) lp 

or by 3360/(n - l) 2 . As *(3 2 4 ) is given by our formula ( 11 ) we see that the 
required term is 

967680 (9n 2 - 32ft 4- 26) W*t7(n 2 (n - l) 6 }. 

A further application of the rule gives us, for the last three terms in k (2 r ), 


r-2 


*(2 + a -2i7( W _i r * *•** 

+ 2 >-* r S. r ~ l)(r — 2)(r — 1)! (n - 2) 

3 ! ft (ft - 1 ) r ~ 1 


+ ! 


>r— 1 


(r-l)l 

(ft - l ) r ~ 1 




Suppose, then, we have a population which is removed to some extent, but not 
greatly, from normality, so that te* and only exist, the higher semi -invariants 
being zero or negligible. If also k 8 and * 4 are small, so that their squares and 
higher powers can be neglected in comparison with their first powers (or second in 
the case of * 8 ), then under these conditions we may write 


' ' • (» — I )*- 1 * |/ + 2 ' n ' 


r(r- 1 ) 1 

— W~ Vi + 2' 


n - 2 


r(r — l)(r — 2 ) 
” 3! 


7i a 


approximately, where 71 = /c 8 * a “t and y 8 = * 4 # 8 - 2 71 and y t are the \// 8 i and >9* — 3 
of a more familiar notation. Such a formula may serve to indicate in what way 
the distribution of k% % the estimate of variance, changes for a slight departure from 
normality of the parent population. The case considered may, however, be of 
limited interest, but it does not seem as if much progress will be made in deter- 
mining the distribution of the estimated variance in samples from non-normal 
populations except under certain simplifying assumptions as to the nature of the 
moment or semi-invariant law. With our present knowledge it would be possible 
to write down quite a large number of the terms in * ( 2 r ), but it is not clear that 
any useful purpose would be served by doing so. 



AN EMPIRICAL AGE SCALE. 


By DRYSDALE ANDERSON, M.R.C.S., L.R.C.P., D.P.H., M.O.H., 

West African Medical Staff. 

General. 

In many tropical countries the opening up to modern methods is very recent ; 
annoying gaps occur in essential information in unexpected places. When carrying 
out work on the subject of vital statistics, knowledge of the different age groups of 
the population is fundamental and yet more often than not it is impossible to obtain 
even approximate figures. 

A study was recently being made among one of the large race units in Southern 
Nigeria, the Yoruba nation, on the subject of malarial endemicity; in this work the 
enlargement of the spleen is of importance when read in the light of the child’s 
age. As with many another native people, the idea of birth registration or that of 
counting numbers of persons is contrary to the general ideas of good luck, and con- 
sequently it is very rare to find a person who knows his or her age. Until an age 
scale could be devised, the work referred to was brought to a standstill. 

Anthropometrical research has been carried out on a large scale in many parts 
of the world with different peoples. Among these the American Negro has been 
extensively weighed and measured, and as birth registration is fairly widespread in 
the United States the ages are also known. In times past, the Yoruba nation has 
contributed in a considerable degree to the present coloured population of North 
America. 

The Method Adopted. 

As many Yoruba children as could be obtained in the schools of Abeokuta, a 
representative town in the middle of the national area, were measured for weight 
and height, and the one was plotted against the other. A second graph was then 
drawn from the weights and heights of the American coloured children at the age 
units. The curves were not unlike in general appearance. 

As a check, a similar set of curves was drawn for English, Australian and American 
white children. It will be seen from Fig. 1 that there is a considerable amount of 
variation in the five curves which cannot be accounted for by the different methods 
of measurement such as the amount of clothes worn. This variation is most striking 
in the younger ages of English children. Also it is obvious that the curves for the 
two African peoples are as significantly different the one from the other as are 
either one of them from any of the white curves. 

If the American coloured curve had fitted that of the Yorubas, one would have 
been fairly justified in inserting the age points on the latter as on the former, but 



DETERMINATION oF AQE 
From HEIGHT and WEIGHT 

(BOYS) 



HEIGHT IN INCHES. 



Drysdale Anderson 


68 


this procedure is impracticable with the two curves as different in detail as they 
are. On inspecting the 'five curves, one thing emerges; age points for all of them 
appear to be in straight bands lying radially across the curves. The difficulty is 
where to locate in these bands the average age on the Yoruba curve. There is no 
information available to act as a criterion to a method for choosing a particular set 
of points. 

As the inquiry had a practical rather than a theoretical end in view — that of 
assigning an age to a child without too large a possible error — the following method 
was formulated. It probably has no greater virtue than any other which might be 
devised, but to those carrying on the work it appeared the simplest and most 
satisfactory although altogether empirical. 

On this understanding the American white curve, being the most regular of the 
four curves of known age, was taken as the index. At all age points on it, the angle 
between the two contiguous straight lines was bisected by lines A ; parallel lines 
Bi and 2? a were then drawn through the corresponding age points farthest apart on 
the curves of known age. In each group the distance between Bi and B% was 
bisected and an indicator line 0 drawn through the point of bisection, parallel with 
A, Bi and jB 2 , to cut the Yoruba curve at a point which became the empirical age 
point. 

The Yoruba Curve. 

This could be drawn from two sets of data, either those obtained from working 
out the average heights of weight groups, or from those of the average weights of 
height groups. Both sets were worked out and the two graphs drawn. There is a 
slight difference between the two due to the larger number of the weight groups 
as compared with those of height*. 

Although this difference becomes more conspicuous at the upper extreme, it is 
outside the school age (six to sixteen years) and is of no importance. The variation 
at the young end can be accounted for by considering the group table. It will be 
seen that there are obviously persons of lower weight than 30+ lbs. in the height 
groups 39+ ins. and 42+ ins. which have been left out. This may be due to a parent 
judging whether a child is old enough to go to school from his height. One does 
not know, this is merely a suggestion. The points on one curve are relatively too 
heavy for the heights in these two groups. 

To differentiate between the two curves they have been named differently. That 
plotted from the average heights to weight groups has been named the “ Height- 
Weight 99 curve, whereas that from the average weights to height groups has been 
called the " Weight-Height” curve. As the probable errors of the mean weights of 
the height groups is much larger than the probable errors of the mean heights of 
the weight groups, the Height-Weight curve was chosen as that from which to work. 
Incidentally it will be noticed that the probable error of the mean weights become 
so large at the upper extremes as to show the result in one case to be non-significant. 

[* The two regression curves should not coincide, and the reason for their non -coincidence lies in the 
nature of regression curves, and cannot be attributed to the cause suggested by the author. Ed.] 



61 


An Empirical Age Scale 



(Reduced in the ratio of 92 to 53 ) 


Fig. 2. Scale on the pale running 
from 8 ft. 7 in. to 5 ft. 6 in. to 
determine from height of a boy 
hie Ruppoeed age. 


Discussion of the Sources of Information. 

The Australian Curve . The data are taken from 
Part II of the Reports of the Principal Medical 
Officer of the Commonwealth Department of 
Education for 1918 and 1919, from the work of H. 
Sutton, '‘Rate and Growth of Australian Children,” 
and of F. A. Mecham, “Physical Condition of 
Children attending Public Schools in New South 
Wales.” The figures have been extracted in six 
monthly age groups, and the age groups 6+, 7+, 
etc., used. An adjustment must therefore be made 
for the age points. An age group contains all the 
children of ages from x years to x years plus six 
months. The mid-point is therefore x years plus 
three months. The reading on the curve is really at 
the point x plus %x. The age point has thus to be 
set back one-quarter of the distance towards the 
previous one. It is definitely stated by the authors 
that all children were weighed without clothes. 

The American Coloured Curve. The information 
is provided by A. MacDonald in his Experimental 
Study of Children , 1899, in which the exact age is 
taken as the age point. There is no statement as 
to the clothes worn. 

The American White Curve. The same source 
as that of the American Coloured Curve provided 
the data used. 

The English Curve. Information was obtained 
from the Report of the Anthropometric Committee 
of the British Association for the Advancement of 
Science in their Proceedings for 1879 in which 
Roberts’s Manual of Anthropometry is quoted. The 
age units points are age last birthday. By the same 
reasoning as that used above with the Australian 
data, it is necessary to set back the age points by 
half the distance to the immediately preceding 
ones. There is no statement as to clothing, if any, 
which was worn, or whether measurements were 
taken barefoot or not. 

The Yorvba Curve. Children were measured 
barefoot and were dressed in light cotton drill 
shorts and a jumper of the same material. 

The Scale. 

This consists of a strip of stiff paper two 
feet long. See Fig. 2, where it has been needful 



Drysdale Anderson 


65 


to divide the sc&le for publication into two parts at age 12 years. With one end 
marked “3ft. 7 in.” it is graduated in ages according to the corresponding 
heights read off from the Yoruba Height-Weight curve. The shaded areas on 
either side of the age heights represent the distance B% and B% on either side 
of C. Any reading coming inside a shaded area may therefore be read as the exact 
age in question; readings between the shaded areas are interpolated in quarter 
years. 

The strip is glued to a flat pole and then varnished with the 3 ft. 7 in. point at 
that distance from one end; this end serves as the foot of the scale. 


TABLE I. 
Yoruba Boys. 


Weight 

in 

pounds 

Height in inches 

Total 


39 + 

42 + 

45 + 

48 + 

51 + 

54 + 

57 + 

60 + 

63 + 

66 + 

69 + 

72 + 

30 + 

1 

7 

10 

2 



_ 





_ 


20 

35 + 

1 

7 

17 

13 

— 

— 

— 

— 

— 

— 

— 

— 

— 

38 

40 + 

— 

4 

27 

39 

5 

2 

— 

— 

— 

— 

— 

— 

— 

77 

45 + 

— 

3 

10 

47 

30 

1 

1 

— 

— 

— 

— 

— 

— 

92 

50 + 

— 

— 

2 

22 

58 

21 

2 

3 

1 

— 


— 

— 

109 

55 + 

— 

— 

— 

3 

42 

74 

14 

— 

— 

— 

— 

— 

— 

133 


— 

— 

— 

2 

6 

53 

36 

2 

— 

— 

— 

— 

— 

99 

65 + 

— 

— 

— 

— 

3 

26 

47 

21 

2 

— 

— 

— 

— 

99 

70 + 

— 

— 

— 

— 

• - 

10 

65 

36 

1 

1 

— 

— 

— 

113 

75 + 

— 

— 

— 

— 

— 

1 

18 

27 

10 

— 

— 

— 

— 

56 

80+ 

-- 

— 


— 

— 

2 

6 

47 

26 

1 

— 

— 

— 

82 

85 + 

— 

— 

— . 

— 

— 

— 

1 

23 

32 

4 

1 

1 

— 

62 

90 + 

— 

— 

— 

— 

— 

— 

— 

7 

22 

8 

1 


■ 

38 

95 + 

• — 

— 

— 

— 

— 

- 

1 

4 

29 

21 

1 

^^9 

9 

56 

100 + 

— 

— 

— 

— 

— 

— 

1 

1 

24 

30 

1 


9 

67 


— 

— 

— 

— 

— 

— 

— 

— 

11 

mm 

12 


9 

64 

110 + 

— 

— 

— 

— 

— 

— 

2 

3 

6 

19 

12 


9 

42 

115 + 

— 

— 

— 

— 

— 

— 

— 

— 

4 

15 

14 


9 

34 

120 + 

— 

— 

— 


— 

— 

— 


3 

12 

17 

— 

— 

32 

125 + 

— 

— 

— 

... 

— 

— 

— 

— 

2 

11 

16 

7 

— 

36 

130+ 

— 

— . 

— 1 

— 

— 

— 

— 

1 

1 

3 

9 

2 

i 

17 

135 + 

— 

— 

— 

— 

— 

— 

— 

— 

- - 

2 

6 

3 

3 

14 

140 + 

— 

— 

— 

— 

— 

— 

— 

— 

— 

1 

4 

1 

— 

6 

145 + 

-- 

— 

— 

— 

— 

— 

— 

— 

— 

6 

7 

1 

— 

13 

150 + 

— 

— 

— 

— 

— 

— 

— 

— 

— 

1 

4 

— 

— 

5 

155 + 

— 

— 

— 

— 

— 

— 

— 

— 

— 

1 

3 


M5M 

SB 

160+ 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 




mm 

165 + 

— 


— 

— 

— 

— 

— 

— 

— 

1 


1 

~9 

mm 

170 + 

— 

— 

— 

— 

— 

— 

— 

• — 

— 

— 


m 

19 

i 

175 + 





_ 

— 

— 

— 

— 



— 

— 



1 

mm 

180 + 

— 

— 

— 

— 

— 

— 

— 

— 

— 

— 

l 

B 

1 

H 

Total 

2 

21 

66 

128 

144 

H 

194 

175 

174 

176 

114 

B 

6 

1411 1 

1 


Biometrik* zxv 













66 


An Empirical Age Scale 


The Girls. 4 

It was seen when the corresponding graph for the girls was drawn that this 
method of obtaining an age scale was inapplicable because of the irregular variations 
which occur about and after the age of puberty. 

Conclusions . 

A method of reading a boy's age from a height scale has been devised. 

Although applicable when children are enumerated in groups or in large numbers, 
the result when applied to the individual will not be inaccurate to such a degree as 
to render it of no value. 

The scale worked out is for boys of the Yoruba nation. It is not known yet 
whether it is applicable to members of other African peoples. 

Note. Any official information contained in this paper is used by the kind 
permission of the Director of the Medical and Sanitary Service of Nigeria. The 
lhanks of the writer are due to Major P. Granville Edge for his valuable co-operation 
and criticism during the carrying out of the investigation. 

TABLE II. 


Yoruba Boys Measurements. 


Weight in 
pounds 

Mean 
Height 
in inches 

P.E. of 
Mean 
Height 

Number 

in 

Group 

Height in 
inches 

Mean 
Weight 
in pounds 

P.E. of 
Mean 
Weight 

Number 

in 

Group 

30 + 

42-15 

±0-31 

20 

39 + 

38-21 

+ 0-76 

21 

35 + 

43-88 

±0-24 

38 

42 + 

40-76 

4- 0*44 

66 

40 + 

45*48 

±0-18 

77 

45 + 

46-06 

•4 0-32 

128 

45 + 

47*12 

±0-17 

92 

48 + 

52-88 

4-0*25 

144 

50 + 

49*77 

±0-19 

109 

51 + 

60-66 

*4 0-29 

190 

55 + 

51*41 

±0-14 

133 

54 + 

69*56 

±0-68 

194 

60 + 

53-35 

+ 0-15 

99 

57 + 

80-72 

±0-50 

175 

65 + 

5f>"29 

±0‘17 

99 

60 + 

94-16 

±0-62 

174 

70 + 

56-32 

4-0-13 

113 

63 + 

110-74 

±0-72 

176 

75 + 

58-15 

±0-17 

66 

66 + 

127*89 

+ 1-09 

114 

80 + 

59-16 

±0-16 

82 

69 + 

136-31 

±2-62 

21 

85 + 

60-63 

± 0-07 

61 





9(3 + 

61-81 

±0*22 

38 





95 + 

62-41 

±0-06 

56 





100 + 

62-97 

±0-17 

57 



1 uwu 

HUo 

105 + 

64*64 

4 _ n • i o 






110 + 

64-01 

X o i • * 
±0-33 

04 

42 





115 + 

65*56 

±0-25 

34 





120 + 

65*91 

±0-64 

32 





125 + 

66-83 

±0-28 

36 





130 + 

66-97 

±0-33 

17 






Total 1365 



Drysdale Anderson 


TABLE III. 

American White and Coloured Boys' Measurements. 


Age 

White 

Coloured 










Height in 

Number 

Weight in 

Number 

Height in 

Number 

Weight in 

Number 


incheH 

in Group 

pounds 

in Group 

inches 

in Group 

pounds 

in Group 

6 

44-69 

102 

42*24 

102 

44-17 

73 

43*44 

69 

7 

45*93 

395 

47-66 

399 

46*08 

246 

50*10 

226 

8 

47*81 

603 

51*42 

605 

47*74 

280 

53*99 

276 

9 

49-76 

640 

56*18 

638 

49*26 

294 

59*04 

287 

10 

51*60 

698 

61 T>7 

696 

53*14 

333 

65*17 

228 

11 

5319 

660 

66*14 

668 

52*10 

268 

69-44 

270 

12 

55*15 

746 

72*70 

751 

53*94 

283 

75*97 

278 

13 

56*68 

677 

79*34 

683 

56*08 

318 

83*50 

317 

14 

59*32 

586 

88*75 

582 

57*98 

280 

90*90 

275 

15 

61 *85 

403 

100*91 

398 

60*09 

218 

99*42 

215 

16 

64*31 

263 

113*83 

263 

63*13 

124 

113*45 

121 

17 

65-97 

119 

121*18 

119 





18 

67*06 

38 

133*99 

38 





Totals 

5930 


5942 

Totals 

2717 

-- 

2562 


TABLE IV. 

English and Australian Boys’ Measurements. 


Atfe 

English 

Australian 

Height in 
inches 

Number 
in Group 

Weight in 
pounds 

Number 
in Group 

Height in 
inches 

Number 
in Group 

Weight in 
pounds 

Number 
in Group 

ft 

41*16 

175 

49-99 

176 





6 

43*18 

327 

54*19 

327 

44*07 

3906 

43*37 

3906 

7 

45*58 

784 

68-89 

631 

45*88 

6070 

47*04 

6070 

8 

47*15 

1052 

59-50 

1038 

48*01 

7171 

51*91 

7171 

9 

49*70 

1241 

62-29 

1262 

50*08 

6174 

56*99 

6174 

10 

61-79 

1193 

66*87 

1200 

51*74 

6902 

61-49 

6902 

11 

53*21 

1230 

71*20 

1129 

53*63 

6370 

66*55 

6370 

12 

54*98 

868 

77-00 

863 

55*23 

6009 

72*29 

6009 

13 

57*36 

1464 

83*43 

1527 

57*27 

5758 

78*80 

5758 

14 

59*43 

2424 

91*91 

2571 

59*34 

3378 

88*16 

3378 

15 

62*02 

1297 

103*60 

1451' 

61*76 

1144 

92*22 

1144 

16 

64*66 

1704 

118*52 

1724 

64*38 

420 ; 

111*43 

420 

17 

66*15 

2055 

131*28 

2106 





18 

66*88 

1675 

137*57 

1669 





Totals 

17489 

— 

17674 

Totals 

53302 

— 

53302 





68 


An Empirical Age Scale 
TABLE V. 

Yoruba Boys : Age from Height- Weight Curve. 


Age 

Height in 
inches 

P.E. 

5 

6 

44-27 

±0*83 

7 

46*10 

±0*62 

8 

47*10 

±0-35 

9 

49*76 

±0*40 

10 

51*73 

±0*22 

11 

63*35 

±0*25 

12 

65-41 

±0*26 

13 

57*17 

±0*20 

14 

59*59 

±0*38 

15 

62*09 

+ 0*40 

16 

64*68 

±0*50 


N.B. The following data were sent at a later date to the Editor of Biometrika 
by Dr Anderson. They allow a comparison betweon the Height-Weight curves of 
two Nigerian races to be made. 


TABLE VI 


Height -Weight Curve for I bo Boys. 


Weight in 

Height in 

P T? 

Number in 

pounds 

inches 

Jr. 

Group! 

30+ 

41*09 

±0.24 

17 

35 + 

43*97 

±0*12 

61 

40 + 

45*41 

±0*14 

88 

45 + 

47-28 

+ 0*13 

118 

50 + 

49*59 

±0-08 

130 

55 + 

51-08 

±0-10 

169 

60 + 

53-07 

±0*10 

145 

65 + 

54*81 

±o-n 

109 

70+ 

55-95 

±0-10 

122 

75 + 

57-06 

±0*12 

96 

80+ 

58*47 

±0*10 

87 

85 + 

59*73 

±0*14 

91 

90 + 

60*96 

±0*14 

73 

96 + 

61*59 

±0*17 

88 

100 + 

62*64 

±0*13 

81 

105 + 

62*94 

±0*16 

69 

110+ 

64*89 

±0*16 

79 

115 + 

65*10 

±0*14 

76 

120+ 

65*58 

±0*14 

73 

125 + 

66*09 

±0*16 

62 

130+ 

66*69 

±0*20 

40 

135+ 

66*66 

±0*24 

26 

140+ 

67*68 

±0*33 

16 

145 + 

68*01 

±0-57 

10 


Total 1925 




Drysdale Anderson 


69 


These measuretnents indicate a very considerable difference between the 
Height-Weight curves of the Ibo and Yoruba Boys. They agree moderately up to 
age 11, but then the Ibo rises above the Yoruba curve, and after age 16 above all 
others but the English. 

[Note. If the three correlation tables, Age and Height, Age and Weight, Weight 
and Height, were known to give straight regression lines, then the probable Age 
for a given Weight and Height would be determined at once for a given race by 
the regression of Age on the two variates Weight and Height, and what is more the 
scatter round this mean or probable age would be given by the usual formula. Now 
what are the difficulties of applying this customary method in the present case ? 

(i) Growth curves are not as a rule linear; they are certainly not so if we take 
the Height or Weight from birth to prime *. But if we take only the portion of these 
growth curves from age 5 to 14, the curves are approximately straightf. 

(ii) Tables of Height and Weight when Age is neglected have as far as the 
annotator is aware not been formed, or are not usually formed. Correlation tables 
of Height and Weight are usually given for each age group; there is no serious 
difficulty in adding such tables together, however, and then working out their 
constants. But will these Height and Weight tables give approximately straight 
regression lines? The Yoruba and Ibo data at least suggest that nothing really fitter 
than straight lines could be found for the range from 5 to 14. 

(iii) The bivariate regression formula can only be applied to races where the 
age at birth is known. We do not know this in the case of the African children. 
To this the answer must be that it is an essential feature of Dr Anderson’s method 
that he applies results from other races and expects no great difference to exist in 
the distributions of Height and Weight with Age between widely different races. 

The special advantage of Dr Anderson’s method is that it may empirically allow 
for some curvature of the regression lines, but what it gains in this way will we 
think be more than compensated by the fact that on the hypothesis of linearity 
the bivariate regression formula is sound theoretically and not empirical, and 
further provides a reasonable measure of the scatter of age round the mean or 
probable value. 

It will be of interest accordingly to determine the regression equation of Age 
on Weight and Height for two fairly diverse races : 

(i) A white race, say English, Scottish or American. The first two will require 
only one further correlation table to be made for each. Until one has examined 
MacDonald’s data it is not possible to say whether it contains for American 
ohildren the requisite material for the Height- Weight correlation table. 

(ii) A coloured race, say the American negro children. Again one cannot say 
until MacDonald’s work is examined whether the requisite material is available. 

* Of. Annalt of Bvgenict, Vol. n. pp. 100—102. 

t Of. referenoe in last footnote, and, better, Biomtrtka, Vol. x. pp. 292—298. Note especially the 
degree of linearity in the Glasgow ohildren. The B. A. returns are based on very heterogeneous material. 



70 


An Empirical Age Scale 

Having obtained the two bivariate formulae we can then predict from both 
formulae the age of various white and coloured children of selected heights and 
weights and test how far the results are independent of which formula is used, i.e. 
of race. If the difference be not very great, we shall then feel justified in applying 
either formula to African children. 

When this work has been completed— and it will take some time to make from 
the data the additional tables requisite* — we shall be in a position to test 
Dr Anderson’s method, and replace it, if needful, by a more theoretically correct 
process. 

In this matter it is well to recall to mind that formulae to reconstruct stature 
from the measurement of the long bones (i) appear to vary from race to race to a 
sensible extent, (ii) that the long bones are more highly correlated with stature 
than Weight and Height are with Age, and (iii) that the probable error in stature 
of a single individual (determined from a single set of bones, not that of a race from 
numerous skeletons) is very considerable. One would not be surprised a priori to 
find a standard error of 2 to 3 years in determining a probable Age from Height 
and Weight, and such a range would possibly render the determination of small 
value for Dr Anderson’s purposes. However, that is only a surmise to be verified 
or not after the standard error has been found. Ed.] 

* The writer has not so far been able to Bee a copy of A. MacDonald’s Experimental Study of 
Children , 1899. 



THE PROBABILITY INTEGRAL OF THE CORRELATION 
COEFFICIENT IN SAMPLES FROM A NORMAL BI-VARIATE 

POPULATION. 


By F. GARWOOD, B.A. 


1. Samples of size n are drawn from a normally distributed bi- variate population 
in which the coefficient of correlation is p. Dr R. A. Fisher has shown* that the 
probability that the coefficient of correlation r of the sample will lie between r and 
r + dr is 

Vn (r) dr, 

n-J 

, (1 - p a ) 2 . “i d n ~* f cos -1 - pr 

JnK ’ « — 3 ! it v ' d(rp) n ~* I Vl -pV*. 

The problem is to evaluate J y n (r) dr, which is the chance of drawing a 

sample with a correlation coefficient less than r. We shall obtain the integral for 
the first few values of n } a symbolic method for the case of n odd also being shown. 

2. n = 3. We shall use the notation 
cos -1 (- rp ) 


v-WrW’ “ d ***<••>• 


Thus 

Put 


1 ^ a( 2/3 dr = f — L=u. DU dr. 

1 — P* J - 1 J-Wl-t* 

r = s in0, yad.r=\ r _DUde. 

Now we find later (6) that if a > p, then 

i l - p . 


.here (, times)... y. 

Thus taking p = 0, a = 1, we have 

and since cos ^ -f ^ ~ r, we have 

cos -1 — r VI - r* p cos'” 1 — pr 

Vf-^r 1 


cflp D a«-2p-i 

" cosff dp^ A*-*- 1 J —- % -(U) pgin9t 


y$dr = 


7T 7T 

Biometrika t Vol. x. p. 607 . 



7 2 Probability Integral of the Correlation Coqfficienl 

3. n — 5. Using integration by parts, we find that 

(fz'py \[ , y* dr - 1 J 1 - r2)i mdr - — "jr* 

r» , , 

We thus have to find sin 0 J) 2 U d0, and using the formula obtained below in 

J n/2 

Section 7: 

L**"' CJl, *-('♦!)£> 


.1 ( r —!L—U^Udr. 
pi- 


•-p- 


Mp+l -Sr- 2p-l 

- cos 6 ^ h A*-*' -1 V (2« > 2p + 1), 

(• * nv*] • 


we see that its value is 


Hence 


It is convenient to express | y s dr in terms of the corresponding ordinates of 
the frequency curves for smaller samples : we use the substitutions 

Z)U= r ^Vi-,-V 


D*U= 


'(i-p't 1 "' 


— -ys, etc. 


Thus we obtain 

Vl-pVl-r* r 


(1 - p®)* \/l -r*‘ 




Vl - r®(l + p®) ^^cos _1 -r 


4. n = 7. Integrating by parts twice, we have 
4! ir f r 


[>- iW+? J" r(l -,‘)imir 

P p* p*J-iv/l- f *' ' 

We thus have to find [ (1 -2 sin*0)Z) 8 £/' dd, and using the fonnula in Section 6 


below with s« 2, jo = 0 and 1, we find its value to be 



F. Garwood 


78 




--(»+0<i§k+^W-*+ r ^Vv + * 

+ r*&UcoB0 p{ * — ( ) . 

1 -r 

Substituting as before, we obtain 

r , VWVr-o* (1 — p*) r rVfIr a Vl^(2-p*) 

J.,**- -— v~ t » + v ' s ‘ — * 

_ r(l -r*H4-V+V) 3 + 6^*-^* ^^— ^ co»~ 1 -r 


5. ilwy odd integer n = 2$ 4- 3. 
We shall have 

»-8lw l> , /> j!z 


3 1 71- fr tr ™zl rt 

-■ — f- . y„dr =1 (1 - r») 2 £/*>+' u dr = (1 - sin 1 0)* Z>* +1 (/ 

(1 “ P ) * (sin 0 «= r), 

and by the formulae obtained below this is equal to 

— » [K - G)£**^ + ©£' *•'*- ■- +<-!'?} #] 

+K) - + <->£}r^]- 

We can immediately obtain the coefficient of ^ j by putting = when 

J y„dr = 1 and cos 0 = 0. Thus we have, finally, 


where 


n = 2s + 3. 


Owing to the complexity of the operator A, the above methods of calculating 
I y n dr for n » 3, 5, 7 have been used in preference to the above formula, which 

-l 

has been included for theoretical interest. 

We now proceed to obtain the formulae mentioned above. 

rf 'd b - 1 (T\ 

6. Evaluation of the integral 1(0)** \ sin* p 6 ( j-l-I ) dd (s>p). 

J-w/2 \wXj J %-ap tin# 

/(jHm 

This is achieved by expanding J ^ > i- e< in an infinite series, 



74 Probability Integral of the Correlation Coefficient 

multiplying each term by sin S; ' 6, integrating, and then summing in a particular 
way. The infinite series* is 


rf 2 " -1 

da*- 1 


cos -1 -.* 

. V 1— .r 2 J 


= Jl 2 .3 2 ...(2s-1) 2 

JL 




* + (2s + 1) 2 jj + (2* + 1 )* (2s + 3)* + . . . 

+ 2 a . 4 2 . . . (2s - 2) 2 1 + (2s) 2 1* + (2s) 2 (2s + 2) 2 £ + . . . J , 


so that we shall be concerned with the integrals 


I sin 2m 0dfl and I sin im+1 0d0, 

J ■ ir/2 J -»r/2 


and these we proceed to evaluate. Calling the first u m and integrating by parts, 
we have 

u, n — [ sin 2 ™ -1 0 sin 0 d0 = — cos 0 sin 2m-1 0 + (2m- 1) [ sin 2 " 1-2 0 cos 2 0 d0 

J rr/2 J -n/2 

= — cos 0 sin 2 ’" -1 0 + (2m — 1) w r „_i — (2m — 1) v m . 

8 in»m-i 0 2m - 1 

■■ 2 -„f('srr2) 9 


(2m -1) (2m -8) 


sin' 


2 m-b 


0 + ...+ 9 - 


(2m- l)(2m- 3)...3 


2m (2m — 2) (2m - 4) “ 2m (2m — 2) (2m 

(2m-l)(2m-3)...3. L f. tt\ . ( a 7r\ 

Similarly we obtain the second integral v m : 


3)... 3 . .] 


■ cos 6 


*_ 1 
_2m + 1 


gin*™. ^ J. 2?W. gin*™ -2 0 

81D P + (2m + l)(2m-l) Sin * 


2m (2m — 2) 


~s\ sin 2 ™ -4 0 + ... + 


2m (2m- 2)... 2 


J )...2 ' 

-l)-3. • 


(2m +• 1) (2m — 1) (2m - 3) *" T (2m + l)(2m • 

The power series in p is uniformly convergent for all values of 0, so that the 
series may be summed after integration. We then have 

I(0)mJ i 8in 2 P0^1 2 .3 2 ...(2s-l) 2 (p8in0+^^— p 2 sin 2 0+...j 

+ 2 2 . 4 2 . . .(2s - 2) 2 (l 4 ^ p 2 sin 2 0 + . . .) j d0 

- j i«. 3*. . . (2s - 1 ) 2 [ pVp + P \ +1 + p-V, + . .. J 

+ 2 2 . 4* . . . (2s - 2) 2 [a, + ' ^ p 2 Mp+1 + + 

8) ~.g | 

-l)...3j 


sin^ -2 0+... 

2p(2p-2) ... 2 
' (2p + l)(2p 


Biometrika , Vol. zi. p. 880. 



F. Garwood 


75 


I (2a i i )»P a [ sin>i,+ ‘fl , 2p + 2__ (2p + 2) (2p ) ... 2 ) 1 

+( +1) 3!(2y + 3 (2p + 3)(2p + l) + + (2p + 3)(2p + 1) ... 3J + 

-co 8 ^2*. 4»... (2*-2)*QS^ + ^^8in^-*(9 + ... 

( g p -^H g p -g)^ ) 

2p(2p — 2)... 2 J 

+ (2s) a £? J sin * P+1 6 + - 2 J?-± 1 sin 2/>— 1 0 + + ( 2 ^ + 1 )< 2 P- 1 )-- 3 sin Ol + 1 

+ W 2!(2p + 2 + (2p + 2)2p * + , " + + 2)(2p) ... 2 8in *'f + -"J 

-i- ( t) a. ,r ^ oa (o 3 l)(2p— 3)...3 , 0 , 2 /j*(2p + l)(2p— 1)...3 1 

+ (^ + 2 j2.4 ...(2s 2) + ( 2 a) 2 , + 2)( ^- __ g +...J . 


The series inside the two sets of brackets [ ] both converge when and 

for this value of 6 each series can be expressed as a double series, the terms being 
those inside each set of brackets { }. The corresponding double series for any value 

of 6 will have its terms less in absolute value than those of the series for 0 = 

and so converges absolutely and may be summed in any of the standard ways. 
We sum by diagonals, viz. we sum the last terms in each of the brackets } j, then 
the next to last, and so on. On the cancellation of common factors we have for the 
value of the integral : 

1 ( 6 )- -“Cos# £{<*ip + a 8 ^ 3 + a6P 6 + ••*) 

+ l 2 {«!/> + (lap 8 + a 6 /5 5 + ...j 


+ 1® .3* 8l ”-^ [a lP + (HP 3 + asp 3 + . . . } 

+ • • • 

sin 2 *' 0 

+ l a . 3* . . . (2p - 1 f ■” {«, p + a,? 8 + asp* + . . . J 


+ l a . 3 s ... (2p + l) a 


sin^ +s 6 

2/T+2! 


[a a p 3 + a s p B + (hP 1 + ■■•] 


a ; n 2p+4 Q 

+ 1* . 3 a . . . (2 p + 3) a 2 -- + 4 --- { asp 6 + 0 ^p> + a,p t +... j 
+•■•] 


- cos 6 


s in# 

IT 


(c/ 0 + «aP a 4* a 4 p 4 .+ 



+ 2 2 {a 0 + a*p 8 ’f a 4 p 4 + ...) 


+ 2* . 4 a { a, + a, p 3 + a* p* + . . . ) 

+ ••• 



76 


Probability Integral of the Correlation Coefficient 

t sin 

¥ 


+ 2* . 4* . . . (2p - 2? [a, + a tP ' + a<p* + . . . } 


where 


sin ** 4 * 1 0 

+ 2^4^..(2^ 8 ^ :^T ?{ as ^ + a 4P « + a,p• + ...! 

+ 2* . 4* . . . (2p + 2)« (tt 4 p* +«,,»• + a,p* + ... } 

-•] 

(# + |)t a « + a *P* + ««P 4 + • • •]> 


a r =*(r+ l)(r + 2)(r + 3) ... (r + 2 p) (r + 2p + 2)*(r + 2p + 4)* ... (r + 2s- 2) 1 . 

19 9 

Now A is used to denote the operator ~ P ^ » 80 ^at 

A*p w = n a .(n- 2)* . (n - 4)* . . . (n - 2p + 2)* p"~** (n > 2p). 

Thus it is clear that 


and so 


a^^A*-*- 1 ^*-* 

9** 

«ip + «ap 8 + asp 5 + — “jyi, A *'*" 1 [p 8 *- 1 + p ** 4 - 1 + p 2 * 4 " 8 + ...]; 


inside the brackets [ ] we may add any number of the terms previous to p*” 1 since 
they will all be annihilated by the operator. 


Thus 

a l p + a a p* + a s p i + . 

•' 3p* p U-vJ 

(0 < m ^ s), 

similarly 

cto + o»p* + a.p 4 + . . 

ap* li-p*J 

S 

(0 < m ^ *). 


Substituting these values in I (0 ), choosing in each case a suitable value of m, 
we have 

♦I-* 


— cos ( 



F. Garwood 


77 


rhe sum of the two series inside the brackets { } is 

bo that we have, finally, 

a 3* a.-*-i \p u ~* p ~ 1 cos -1 - p sin 0) 

"v** ” $=? 

7. Evaluation of the integral 

L/ 2 8in%,+1 6 (SO,*." (2S >2p ~ n 

By an exactly similar method we can prove that the value of this integral is 


K) 




dp*”* 1 1-p* 


0*P+1 

C0S ^dp*v+i 


p la -2p— 1 C08 -l _ p gi n ff 

1-p® Vl— /^sin 1 #. 


8. Even values of n. 


The term (1 -? a ) 8 is now rational, so that the substitution r«sin0 is not 
used. The integration proceeds either by expansion of this term and repeated 
integration by parts, or by the method given below, which has the advantage of 
giving the integral in terms of the previous ordinates. 

For completeness we include the easily solved case of n = 4. We have 

[' y t dr = (1 ~ pt)i r D*Udr = {1 ~ p *P[Duf . 

J - 1 7T ./_! irp J . p 

Now we know that 

dU 1 ri 7n , llTr Vl-r 1 ^ 
i -^ [1 + xU] > md Me Ty-^’ 


/_V 4dr= 


Vl — p 2 V 1 — r* • Vl—p 8 cor "" 1 p 
- y$- — + 

p irp 7T 


9. n « 6. 


Assume that J yidr = A+BU+FiDU+ ...+F p JPU t 

where A, B t Fi ... F p are functions of r. Differentiating with respect to r, we see 
that 

A 9 +<B't7+ ( JY + pii)D?7 + (iy + pFt ) WT + . . . + (F/ 4- p J^) D*U+pF p D*> +1 U 
p+ 1 =4, i.e. p = 3, and 4' «■ 0, fi' = 0, Fi' + pB= 0, /»' + - 0, 

p*V + o, pj; = 



7£ Probability Integral of the Correlation Coefficient 


3 P 




r,- F, n-n - j.pU-^O-fl* 

virp Z7T Sp 

Ft J'zJ>lt r f.tj*uJ 1 --p^ d * u (i-pV. O-pV 
1 3t rp ’ * it • 3/o* ~ 3/>* 3/4 ’ 

** (1-p*) 1 Ji - £>M (_ 0 -p*) 1 (i - ?i ) 4 \ _ _ <Izp 8 )* „ 

Sirp 3 ’ 7T v'l - r* V 3/9* ) 3 p 8 

2J*» 0 aud il « const., and to find A let r -► — 1 : we have 

j)tt _ 1 . rpcm -'i- pr ) 1 pcos~V 

1 - »V (1 - ^p 2 )! I — P a ~ ( 1 — p 2 )® ’ 

and 2)* U = 1 + 3rp 7) 77] - - [( 1 + 2p 2 ) cos" 1 p - Sp Vl -p 2 ] . 


Thus 


j _ 0 “ P 2 )^ 1 - V) , cos- 1 p 
- — i 3 V + tt 


p (1 -p*)® Vl - r* (l-p 2 )r 

3p* 2/3 + 3p* 2/4 

vr- p* vi - r* 

+ 3p y " 

10 . 8 . 

Using the same method, we obtain 

*)* V 1 - j* r (1 - p 2 ) 2 
5p 6 ' 2/8 " 5p 4 2/4 

(3r*-l)(l-p*)®(l-r 2 ri (1 — P*) r 

+ 15p s ,A + 5 p 2 y#+ 5p ' / 


r **•- . * q.-j> 

J -1 ‘ lOTTO 0 7T 


/: 


= 


Vi — p 2 ( 1 — 4p 2 ) . cos 


37 rp 8 


+ 

7T 


11. Proceeding in the like manner we can obtain the probability integral for 
larger samples, but the formulae then become too complicated for practical use. 
Miss F. N. David, who is computing the Probability Integral Table for the distri- 
bution of r in small samples, is using the above formulae for the smallest samples, 
but proceeds in a different manner for the larger sample sizes. 



Ei'ratum 


On p. 78, the first term in the equation for J* y a dr should be 





A FURTHER NOTE ON THE RELATION BETWEEN THE 
MEDIAN AND THE QUARTILES IN SMALL SAMPLES 
FROM A NORMAL POPULATION. 

By TOKISHIGE HOJO. 

1. Introductory . 

In a previous paper* I have considered certain properties of the median and 
quartiles in small samples from a normal population, and have investigated methods 
of linking these up with the limiting values in large samples. The latter problem 
has since been further investigated by K. Pearson f. If the population mean and 
standard deviation are unknown, the most satisfactory estimates of these, when 
sampling from a normal population, are obtained from the sample mean and 
standard deviation. But a variety of alternative estimates may be calculated from 
the median and quartiles, whose relative value may be judged by a comparison of 
their standard errors. In the earlier paper the following comparisons were made: 

(1) Standard error of mean with that of median. (Tables 11®^, loc. cit.) 

(2) Standard error of an estimate of a obtained from the sample standard 
deviation with that of another estimate obtained from the interquartile distance. 
(Table VIII, loc. cit.) 

In the present paper I shall add to these previous results : 

(3) Standard error of the mid-point between the two quartiles, i.e. of £ (q x + (/ 2 ). 

(4) Standard error of an estimate of cr obtained from the distance between a 
quartile and the median, i.e. from q x — m or m — q%. 

The first result follows from calculations previously carried out; for the second 
it is necessary to determine the coefficient of correlation r Qttn between quartiles 
and median. 

Certain experimental sampling results will also be referred to. 

2. The Standard Error of the Mid-quartile Point, £ (q x + q%). 

In the limiting case of large samples, the standard error of the point mid- way 
between the two quartiles has been given by K. Pearson as 


°‘ito+<r.) “ 11126ff/V» (1> 

It is easily seen that <rm+Q,) “ °<i (2), 


from which the ratio of <?ifa+q,) to the standard error of the mean, <?/Vn. may be 
readily calculated from the results given in Tables IV 6 and VI 6 of my earlier paper. 
Some comparative results are shown in Table I below, the figures in the last 
column being taken from my previous Tables II 0 and II**. 

* Biometrika , Vol. uni. pp. 315—360. t Biometrika , Yol. xxm. pp. 861—397. 



80 


Median and Qxiartiles in Small Samples 


TABLE I. 


n 


r 

01«1 

<r 

Vi+t ri+Jn 

'- + £ 

4 

1-1890 

•4888 

1*000 

1*092 

7 

1*3400 

•3116 

1*086 

1*214 

10 

1*3229 

*3717 

1*096 

1*177 

12 

1*2740 

*3895 

1*062 

1*190 

22* 

1*3419 

*3463 

1*101 

1*217 

00 

1*3626 

•3333 

1*113 

1*253 


Owing to the four different forms of quartile definition, corresponding to the 
cases n*=4p, 4p4-l> 4p + 2 and 4 p-f 3, the tabled standard errors do not change 
uniformly as n increases, but it is seen that in all cases the standard error of 
4 (?i + 92 ) is l® 88 than that of m. In other words a more reliable estimate of the 
population mean is obtained from the mid-quartile point than from the median, 
though both estimates have a larger standard error than the sample meanf. 


3. The Correlation between the Quartile and Median Points . 

The chance that the gth individual in a ranked sample of n lies in the interval 
x ± ^dx is given by 

<3) ’ 

rx 

where a x *= dx 

J —00 v 2 tt 

This is the fundamental relation used in the following analysis. 


•(4). 


Since the mean value of ra, the sample median, is at the origin, the numerator 
of the correlation coefficient, or 

q :Mr m,q * * a q (®)» 

will be the mean product of {m x g), and it is necessary to consider separately the 
four cases previously defined. 


(a) Case n * 4m +■ 3|. 

Here both median and quartile correspond to values of observations, and we 
are concerned with the mean value of a product ; in fact 

* (2m +1) ! m ! a j_oo j-oo^ + ^ ~ ^ ~ 


* The figures 1*8419 and *8468 are due to K. Pearson’s calculations, loc . cit. pp. 880 and 890 ; the 
figure 1*217 is obtained from my equation ( b ), p. 828, loc . cit. 

t For the limiting values of other forms fcf estimate of the mean from ranked individuals, Bee 
K. Pearson, Biometrika , Vol. zm. pp. 118— 128. 

X This m is an integer, and must be distinguished from the m used as a subscript whioh denotes 
the median. 




Tokishige Hojo 


81 


n! 




jm—r 

Tl 


where 

and 


" (2m + 1) ! m I* J _ J (1 ~ a *' rWl *** r ? 0 1 )r 
* {[-"«' ""«]!. 

X r« r - r ^^r «?*****•* 

J —00 J —00 

“ 2* (2i» + 1 ) ! m ! 1 A 1 wCf * r -0 1 ^ "* Cr ^ 2m + r + 1 ^ 

71 T m ( m 

” 2tt(2to+ l)!w ! 9 Jt o 1 ^* ”*°* | Tam+ ‘ r ~o (“ 1 ^ ( 2m + r + 1 ) 

m 

+ (- 1 ) r mPr ( 2w + r + 1) (TO + S - r) m+s-r-Jlm+r ' for TO £ 1 . . .(6), 

f+oo ^-ias* ; 

r '-L<s* (n 

» A r = ^.|_ oo «^e~ x,, <*BiJ l ^a r Xi e- r >'da; 2 (8). 


r =»0 


( b ) Oase n - 4?n + 2. 

Here is the mean value of the product {$ (<r 3 + x 2 ) x since the median 
(as defined) lies midway between the two central observations. 

j> ~ - (*odT. i r.L fy~ 1 1 - ■ «■.>" 

x ) a ‘i da *. da *.dax, 

* 2(2to)!to!(to --T)j: f> “ ax - )m " 1 

xfL^f-1" + 2 to r a* m 1 ^ <fo s + j «,i 
11 ** V 27 rJ-« J- oo * 2 tt 3 2 w + l 2 J 

xLHr'C** 

ar • a «m+r+l . 

+ *~£*» ^ *» + £ + - r *.*.} (»). 


Biometrika xxv 


6 



82 


Median and Quartiles in Small Samples 


in which 


a?i* 

+ “ + ‘ 2 ;- r ' , n«r"-«-^.L«rv-v, ! 


- (m + « — r - ]) w + <r . y- 2 ^ 2 m -fr + 


f+oo 

I „»* 


.‘Vm+« - 1 e 


f+°° /*» f** , *i a +;e,s+2;ra* 

I C <C •!«""' * 

J —CO J — 00 ^ — 00 

[+«• [°° 2 , _(*i+*.)«+2x.« Cf fe+*,+a)<-|oo 

m ‘l-Jo a '*+» a * e 2 dtC3da,i \ [- a *t:z*. e 2 J 0 

V 2 w Jo *<+*•+*> ar *\ 

= j_ a ,®*" - * e ~ r,l(ia: i ( v ^ 7r/ »»+*-i “ f^ a T ¥a ' 1 e-^dxj 

M- 0° roo (JH+X»)2 + 2.ri* 

+ ( w + . s _ r _])j e ~ 2 ^ 3 ^i 

v // f Xl+X > +< -r -2 e"** , \ 

X l Wr |-. “* Wn ) 

f -f-oo 

= 2tT (/^m— lAn+s— 1 “ 2m— 1 1 m-fs— l) (wi + 5 — ?’ — 1) I 0 ?™ 1 e~~ x * 

J —00 1 


_ " V 27 T , +1 r*‘+** m+ „_ r _ 2 e -x ' ]® (» V 2 tt m4 .„_ 1 , ) 

— 9 i r W + «- r-ir 

~ ~ 7r |wH *— 1 * 2m— 1 + ~ An,+»— r~ 2 (-* 2m~l ~ ^2w-fr) 

~ ^»*+*-r-“2^2w-l + 2m+r^w+«-r- 2 +" An-fs-1^ 2m- 1 ~ 2m-l^m-f*-l | 

~27T j w +*-i/ 2m _i + — ( m+s _ 1 / 2m _i “ m . M _ r _ 2 /2m+r)| *11). 

f + P** 1 /v w + «~r-12m+r+l ®i*+a** , f+°° m .j a _ r i •£»* 

J -ooJ-co ** *** ***** dXtdo'l = j 2 ^*! 


. mfa— r 1 


X\ e 2 j 


_ a 2m4 r+1 2wi + r j- 1 f** 




ry 3 » t +*c- a: > , ] + ® f-f” 3 m + « 3m+ . , , 

" 2j-«~L«,2V%r e *'• *i+(2»+r + l) 


•«r— 1 .-*r <rs?**r 

J —oo V 2 tT J —a 



Tokishige Hojo 


83 


f + * ™±±Z*-~- 1 - a™*'-'-* e-*>'dx 1 r 

J-» V2tt * M-co 1 


V2' 


dtr j 


3 m + .<? , 




9 ‘ 7 , 3m+«-l + (2m + r+l){(»M-«-r-l) m+<l _ r _ 2 /' 2 „ l+r + ysm+«-l] 

( 12 ). 

On substituting (10), (11) and (12) into (9) we obtain, on collecting terms, 

7i! S 

(2m)! m! (m- 1)! 47t 8 «o 

(2m — r) (2m + r + l)(m-fs-r-l) 


p -- r "v 1 1 nr r \ 2m ( m + a ) , 

l ' m8 r ^ 0 V O m-lWj r “+l 


3m 4- s - 2r 


(2m 4 l)(r 4- 1) 


/ TO — 

' m+a-r- 2* 2 w+r + g ** 3ro+* 


-} 

for m £ 1 (13). 

(c) Case n — 4m + 1. 

Here the median corresponds to an observation value, but the quartile point 
lies midway between two observations. P ^ is therefore the mean value of the 
product {#3 x l (x 2 4* ff x )}, or 

P = H * [ + f* 1 [** a 2w (a -a yn-in-a y»-i 

mq (2m)! {(to - l)!| a J_J_J _«,“*» ( *• x>) (1 * l) 

x .r 3 J (,r 2 + x x ) da Xt da Xt da Xl (14). 

Following a similar process of development to that used in the previous cases, 
it may be shown finally that 


p _ n • V/_ 1 , (* m v*/— i \r n _ 2 m 4 r 

~ (2m)! {(m - 1)!} 2 47 t 8 «o ^ J %V ' ^ r (m 1 r)<« + 1) 

x {(s 4 7 a -r)( s -m4r4l) m H a-r-i ^ 27/i+r-i 

+ (m - r) (m — r — 1 ) m _ f _ 2 / 2m+r -i - * (* f 1 ) a-i^-i 

+ (m 4- r) (T 3 m-2 - ^Sm+a-i )} for m ^ 2 (15). 

( d ) Case n = 4m. 

Here both median and quartile fall midway between observations, and P m is 
the mean value of the product { J {x± 4- #3) x £ (# 2 4- #1)}, or 

7 i! f +Q0 f Xl i Xt i X% Q™ 1 

PnQ = (2m -1)1 (m - 2) ! (to-1)! J _<J _J -J* Xl ^ x, ~ a ^ )m ~ 2 0 " 

xl(^ + ^3) J (®s + x l )da Xt da Xt da Xt da Xl (16), 

and it can be shown after a rather lengthy process of reduction that we obtain, as 
a final result, 

w | m-l w— 2 

PmQ " (2 m-l)l(m- 2 )](m-l)Tl 6 imr ,? 0 m ~ lCa “ 0 ( ~ 1 X»»-a C 'r 

(m 4- 8 - r — 1 ) {s — m 4' r 42) ( 2m 4 ?•) (r - 2m 4- 1) 

(r 4- 1)($4 l)(m- 7 *— 1) 


m+a—r— 2^2m4r- 1 


6 — 2 



84 


Median and Qvartiles in Small Samples 


(m — r- 2)(2m + r)(r- 2m + 1) , , (3»» — 1)» r 

+ ( 7 + 17 ( 7 + 1 ) m-r-S' tm+r-l + m _ r _ j 

2w(2m- l)(m — 1) r 

+ "(rTI)(«+Tr^ 


2m (2m — l)(m + *){1 1 \ T 

+ — -j - m+a—l‘tm—i + 

3 m - 5 — 2r — 3 ) 

+ ~'T(7+T) • /3 »+*-*{ 


2r- 3w + 2 ^ 

2(» + l) Sm “ 8 

for 2 (17). 


From the preceding equations the values of P m may be calculated for 
ft* 1, 2,... 12, using the numerical values of the integral forms T rf I r and p I r 
previously computed*. The values of <r m and <r Q required to find r m have already 
been calculated. The results are collected in Table II. 


TABLE II. 


Size of 
sample, n 

Pm 


<Tf 

£ 

3 

11 

j 

1 

1-00000 

1-00000 

1 *00000 

1*0000 




2 

•50000 

•70711 

•82665 


•8564 



3 

•27566 

•66983 

•74798 



•6502 


4 

•25000 

•54608 

•57952 




•7900 

5 

•17829 

•53557 

•54948 

•6058 




6 

•16432 

*46340 

•52876 


*6706 



7 

•13074 

•45874 

•50669 



•5625“ 


ft 

•12364 

•41011 

•43777 




•6881 

9 

•10364 

•40755 

*42436 

•5990 




10 

•09842 

•37214 

•41835 


•6322 



11 

•08590 

•37044 

•40708 


1 

•5696 


12 

•08269 

•34346 

•36777 




•6539 


4. Limiting values for r^. 

We may use again the method given by K. Pearson f; four cases will arise as 
before. 

(a) Case n « 4wi + 3. 

Here the median and quartile points correspond to observations, and the result 
has been given in the paper just mentioned, i.e. 

r m~ — *5774 (18). 

(b) Case n « 4m -h 2. 

Here the median and quartile are of form J (tz 3 + # 2 ) and $1 respectively, and 
Pmq ~ \ (Mean (a^i) + Mean (^^x)) 


J— ?» / 

i_*jU 1*. *J>(i 

_!A] 

n \ 

n J nz\Zk ft \ 

«/j 


* See Table 1. pp. 325-826, loc. cit. 


t Biometrika , Voi. xm. pp. 115—117. 




Tokishige Hojo 


86 


where 


fs 1,1 s, _ 1 1 3 

n m 2 + 2n’ n~2~2n,' 7“ 4’ 


and z k and z k are the tabled ordinates of the normal probability function, corre- 
sponding to the proportional areas 

1 /. 1\ 1 . 3 


|(l+«A) = |(l + ~). |(1 +o*)-|. 


It follows that p L (19). 

»n z h z k 

Further, from equations (1'2) and (15) of my previous paper, we know that 

•’’-TnkJ'-l « 

m 

Values of may be calculated from (19), (20) and (21). For example 
I a h = 02, z h = •3988169, z k = -3177766, 


V n2z h 


if n = 50 =-986313 


?> = 124111 ij*' 


■ 1-36263 


. n«7= -5 832, 


while if 


n = 102, r v 


( c ) Case 7i = 4ra + 1. 

The median and quartile are now of form # 8 and J (a 2 + u x ) respectively, and it 
is found that 

« 

where z^ and z^ are the ordinates corresponding to 

1/t . v 3 1 1,, , ,3 1 

a (1 +“**) = 4 + a * a(l+ a fc,)- 4i a_- 


Further, 


' 1*25381 ^ (23). 


f 1 

_ 4 _ 

4 N 

, , 1 i 

(, 4 

4 \ 

2 /, 

8 

4 \) 

k*( 

Zn 

' 3n s J 

l+ ^' 

l 1 + 3i»' 

" 3n*J 

1 + — ~(l 

"3^ + 

3n*)| 


71 — 49, f'mq ! 
n = 101, 


[ # The limiting value of cr q given in Equation (18*1), Biometrika , Vol. xxxix. p. 888— Professor Hojo’a 
previous paper— appears to be in .error. 1 do not follow the values given for » lt #g, B x and ff a . On 
p. 846 the oorreot values are given for the #' 's, but the values of the £Ts, differing from those on p. 888, 
appear to be in error. The limiting values for ns ao would not, however, be affected. Ed.] 



86 


Median and Quartiles in Small Samples 


(d) Case n » 4m. 

The median and quartile are of form + and £(# 2 + #i) respectively 
and again as in (22) it is found that 


mQ 1 Qm h (< 


.fi( 1 .?) + i( 1+ ? 


.(25), 


where ^ and z^ % are as before, while <r m and <r q are found from (20) and (24). 


If i»- 52, r m =- 5903. 

ft = 100, = *5841. 

« « 500, r wl £ = ’5787. 

We see that as n increases, the values of r m in cases (5), (c) and ( d ) tend to 
the limiting value of case (a), viz. r m = '5774. 


5. Empirical formulae to bridge the Gap between limiting and small Sample 
Values. 

In the previous paper the following formulae were given to provide values for 
cr m between the computed series (n< 12) and the limiting value: 

a m j = 1-2533 - -2653/n - -0699 /m* + -0822/m 3 for « odd | 

= 1 *2533 — -8261/n. + -7826 /k* — -3478/h 3 + 1304/w. 4 for n even] 


These “ best-fitting ” asymptotic curves were obtained using the method of 
moments. Similar equations have been calculated for both a q and r mq , but as here 
there were only three computed points for each of the four cases, the curves 
actually pass through these points The equations are as follows : 


= 1-3626 - 121 15/n + 18693/n 2 - 11261/n 3 , 

= 1-8626 - 10046/n + 1'9330/n 2 - l-2910//< a , 
= 1-3626- -3800/n- '2110/n 2 + '3819/n 3 , 
= 1-3626- -1 055 /m — -3805/m 2 + '2789/m 3 , 


for n = 4 m 


for n = 4m + 1 
for n =s 4m + 2 
for n = 4m 4- 3 y 


...(27). 


r m = -5774 + 1-0150/m - 1-4064/n 2 + 29952/n 3 , 
= -5774+ -2842/m- 9215/m 2 + 10600/m 8 , 
= -5774+ "5250/n + -2778/m 2 - -4230/n 3 , 
= -5774- -0269/m- -8210/m 2 + 1-9722/m 3 , 


for n = 4m "l 
for n — 4 m + 1 1 
for n=4m + 2j "‘^ 8) ' 
for n = 4m + 8; 


A combination of results is given in Table III, consisting of (a) correct 
computed values, (6) limiting values obtained by the method of section (4) above, 
(c) results calculated from equations (26), (27) and (28). 


The column dealing with cr Q „ m is referred to in the following section. 



Tokishige Hojo 


87 


TABLE III. 

(Cases : A, n = 4m + 1 ; B, n * 4m + 2 ; C, n = 4m + 3 ; D, » — 4m.) 


Cane 

Size of 
sataj>l6, n 

'•/ 7« 


r mq 

’’-/js 

A 

1 

1-0000 

1*0000 

1-0000 

0-0000 

B 

2 

1 0000 

1-1676 

*8564 

0-8525 

C 

3 

1*1602 

. 1-2955 

*5502 

1*6556 

D 

4 

1-0922 

1-1590 

•7900 

1-0355 

A 

6 

1-1976 

1*2287 

•6058 

1-5237 

B 

6 

1-1351 

1-2952 

•0706 

1-4100 

0 

7 

1*2137 

1-3406 

•0625 

1-0970 

a 

8 

1-1600 

1-2382 

•0881 

1-3431 

A 

9 

1-2226 

1-2731 

•5990 

1-5816 

B 

10 

M768 

1-3229 

•6322 

1-5275 

a 

11 

1*2286 

1-3501 

*5696 

1*6986 

1 ) 

12 

1-1898 

1*2740 

*6539 

1-4536 

A 

13 

1-2325 

1-2902 

•5942 

1-6129 

B 

14 

1-1982 

1-3345 

•6161 

1-5788 

c 

15 

1-2353 

1-3540 

•5725 

1-0995 

1) 

16 

1-2047 

1-2939 

•6360 

1-5118 

A 

17 

1*2375 

1*3100 

•5911 

1-0316 

B 

18 

1-2098 

1 -3409 

-6073 1 

1-6070 

0 

19 

1-2392 

1-3561 

•5739 

1-7005 

D 

20 

1*2139 

1*3066 

•6250 

1-5480 

A 

25 

1 -2426 

1 -3255 

*5873 

1-6531 

B 

30 

1-2266 

1*3497 

•5951 

1-6467 

C 

35 

1-2457 

1*3593 

•5760 

1-7022 

D 

40 

1-2331 

1 *3335 

•6019 

1*6244 

A 

45 

1*2474 

1-3412 

•5832 

1-6754 

B 

(49) 

(1-2533) 

(1-3449) 

(•5850) 

(1*6777) 

50 

1 *2371 

1-3549 

•5880 

1*6703 

D 

(50) 

(1-2411) 

(1-3626) 

(-5832) 

(1-6879) 

(52) 

(1-2416) 

(1-3459) 

(*5903) 

(1-6614) 

C 

55 

1 -2485 

1*3606 

•5766 

1*7035 

I) 

60 

1*2398 

1-3430 

•5939 

1-6511 

C 

99 

1*2506 

1-3615 

*5770 

1-7046 

D 

(99) 

(1-2533) 

(1-3626) 

(-5774) 

(1-7062) 

100 

1-2451 

1-3507 

•5874 

1-6727 

A 

(100) 

(1*2471) 

(1-3638) 

(-5841) 

(1-6827) 

101 

1-2507 

1*3529 

*5801 

1-6920 

B 

(101) 

(1-2533) 

(1-3539) 

(*5811) 

(1-6921) 

102 

1 -2453 

1-3689 

•5825 

1*6887 


(102) 

(1-2473) 

(1-3626) 

(-5802) 

(1-6972) 

D 

500 

1*2527 

1*3608 

•5787 

1-7018 


(500) 

(1*2521) 

(1*3609) 

(•5787) 

(1-7015) 


00 

1-2533 

1-3626 

•5774 

1-7062 


The numbers up to re =12 are exact, those in brackets are the limiting values, obtained by the 
methods of Section (4), and the remainder are obtained from the Equations of Section (5). 


6. The Standard Error of the Distance between the Median and Quartile Points. 

The mean value of this distance in repeated samples is clearly the mean value 
of the distance of the quartile point from the population mean, or A table 




88 


Median and Quartiles in Small Samples 


of 5 was given in my previous paper ( loc . cit . p. 341), and this was extended by 
K. Pearson (loc. cit. p. 372). 

The standard error of (q — m) may be calculated from the following relation 

V- m = <Tq 2 + ( 2<J )' 

which has been used to give the valuos of the ratio of <r a _ m to the approximate 
standard error of a standard deviation, shown in the last column of 

Table III. 

On p. 358 of my other paper two estimates of the population standard de- 
viation, <r , were compared; Ei based on the sample standard deviation, and 
based on the interquartile distance. Both estimates were so adjusted that their 
mean values in repeated samples would be a. It is possible to obtain a third 
estimate from the distance, q — m , between the median point and a quartile point, 
namely 

E$=(q ~m)lq (30). 

Wc shall have 

Mean E s = cr (31), 




a g-m = 

5 


1 

5 




(32). 


Values of 6 3 and of the corresponding multipliers for E\ and 2? a are shown in 
Table IV. 


TABLE IV. 

Comparison of Standard Errors of Estimates. 


n 


A, 


2 

1-511 

1-511 

1-511 

3 

1-280 

1-286 

1-966 

4 

1-194 

1-250 

1-561 

5 

1-148 

1*185 

1-838 

6 

1-120 

1*691 

2-197 

7 

1-100 

1*469 

2*241 

8 

1-086 

1-429 

2-027 

0 

1-076 

1-360 

2-103 

10 

1-068 

1-598 

2*328 

11 

1-061 

1-526 

2-331 

12 

1-056 

1-498 

2-187 

GO 

1-000 

1-649 

2*530 


It is clear that E 3 would be an estimate of 5 of very little value, and one which 
is distinctly worse them E 3 . In addition to its large standard error, it is far from 
normally distributed (see Table V below). . 




Tokishige Hojo 


89 


7 . Comparison of interpolated Values with those found by K. Pearson's Method. 

Certain of the values of r m , a and <r q obtained from the empirical equation of 
Section (5) were recomputed by K. Pearson's method*, in order to obtain some 
measure of their accuracy. The appropriate formula for the mean value of the 
product {ar,#,} of two ranked individuals is his (xxi)‘* r . Taking the case n = 4<p + 8, 
for which both median and quartile points correspond to observations, we obtain 
using his notation, in which m refers to the median, 


Mean {.* TO * fl ]/o 2 




n — m+1 _ , (n— m+ l)(n - m + 2) _ 

h + 1 ®'»+ 1 + ‘' a (n +• l)(n+ 2) flT “ + * 


~b 3 


(n— m + l)(n —m + 2)(n — m + 3) 
(n + l)(n + 2H^+ 3) 


<r^n+3+- • 

....(33), 


where he has given numerical values for the 6’s up to 6 13 ( loc . cit Equations (xi) 
and (xii)). 


Case (i). Samples of 15 ; n = 15, q = 4, m = 8. 

After calculations similar to those carried out by K. Pearson (loc. cit pp. 388 — 
389) I find that 

Mean [x m *q}l° 2 - *063,271,899. 

Further, a q and a m may be obtained from his formula (xvii) and Table VI 
respectively, as follows, 

o-Jcr = *347,0143, <tJ a = *318,692. 

The former leads to a value of 1*34398 for the ratio of <r q to crjfjn, which may 
be compared with the value of 1*3540 obtained from my formula (27) above. 

Since x m = 0, it follows that 

r m = Mean {« m a: a }/(ff m <r a ) = ’57213, 

which may be compared with the value of 5725 obtained from formula (28), and 
given in my Table III. 

Case (ii). Samples of 35; n « 35, q *= 9, m - 1 8. 

Here I find 

Mean {x m x q )!a* « *027,753,212, 
and from K. Pearsons formulae (xvii) and (xviii), 

ejer = *229,4863, ajc - *210,5396. 

Hence <r q j 1*3577 against my 1*3593, and 1*2456 against my 1*2457. 

It follows also that *57441, 

whereas my formula (28) has given a value *5760. 

* Biometrika , Vol. xxm. pp. 884—890. 



90 


Median and Quartiles in Small Samples 

There is little doubt that K. Pearson's method is likely to be more accurate 
than that which I have employed for interpolating between n = 12 and the limiting 
values, but it involves calculations of some length even in the simplest case when 
n * 4p + 3. For the cases n = 4p + 2, n = 4p + 1 and n = 4p the work involved 
would be very much longer, and no comparison has been attempted. 

8. Some experimental Sampling Results . 

The following further results have been shown for comparison. 


TABLE V. 


Size of Sample, « 

(No. of Samples, N) 

4 

(1000) 

7 

(1000) 

10 

(1000) 

15 

(1000) 

20 

(1000) 


40 

(500) 

( Expori merit 
r,n V ^Theory ... 

'8011 

•7900 

*5627 

•5625 

•5933 

•6322 

•5290 

•5725 

•6250 

•6250 

*6030 

•5951 

*5539 

•6019 

/ <r ( Experiment 
- 

1*0451 

1-0355 

1-7406 

1-6970 

1-5929 

1-5275 

— 

l *5200 
1-5480 

1-6209 

1*6467 

1-6876 

1-6244 

Pi for (< i - w), Experiment 
$ 2 )> 

•5463 

3*4345 

•8088 

3*9728 

•7907 

3-7666 

— 


•3526 

3-9191 

•1637 

2-9405 


The values of were calculated directly from the data (except in the case 
n = 15), and it is found that none of them differs from the theoretical values by 
more than twice the appropriate standard error, r ^ was then calculated from 
equation (29), using the experimental values for <r Q and a m tabled in my earlier 
paper. In the case of 15, r mQ was calculated directly from the data. 

It will be seen from the values of f3 x and (3% that the sampling*distribution of 
q — m is far from normal, even where the samples are large, and as has been pointed 
out the estimate of a obtainable from this inter-rank distance is not a satisfactory 
one. 











A FURTHER STUDY OF METHODS OF CONSTRUCTING 
LIFE TABLES WHEN CERTAIN CAUSES OF DEATH 

ARE ELIMINATED. 

By M. NOEL EARN, M.A 

In a recent paper entitled “An Inquiry into Various Death-rateB and the 
Comparative Influence of certain Diseases on the Duration of Life”* D'Alembert's 
method was applied to the construction of life tables for a population from which 
cancer and tuberculosis were supposed to be eliminated as causes of death in order 
to estimate the effect of these diseases in shortening the duration of life. In that 
paper no reference was made to the work on this problem by Dr Farr towards the 
end of last century, nor to Louis I. Dublin's work in recent years. In the present 
paper a comparison has been made of the results of the several methods when 
applied to the same data in order to determine what, if any, are the practical 
advantages of employing D'Alembert's formula over others which have been used. 
As to the theoretical advantages of D'Alembert's formula, I think there can be no 
question. 

Comparison between Results of Farr's Method and D'Alembert's Formula. 

Dr Farr’s work was published in the Supplement to the Thirty-Fifth Annual 
Report of the Registrar-General , 1875, and entitled “Effect of the Extinction of 
any singlo Disease on the Duration of Lifef.” In this he made passing reference to 
the previous work on the same kind of problem in connection with the controversy 
over inoculation, mentioning Daniel Bernoulli and D’Alembert, giving Duvillard's 
value for the increase in the mean life time which would result from the extinction 
of small-pox. He then referred to the short method which he had described in the 
Appendix to the R.-G.'s Fifth Annual Report using it for this particular purpose 
as being sufficiently exact, and thus saving the labour of constructing and graduating 
full life tables. 

In his short method Farr made use of quinquennial age-groups. The number 
of deaths in age-group x to x + 5 years divided by the population of that age-group 
gave the probability p of living one year in the middle of the period. The fifth 
power was therefore taken in order to obtain the number of survivors at the end of 
the period. Thus the chance of living for five years at age x was p 6 . 

* Annah of Eugenic*, Vol. iv. pp. 279—826, 1981. 

t P. xxxviii. g 21. 

$ P. 862, “A short method of constructing Life Tables.*’ 



92 


Life Tables with eliminated Diseases 

The method was only applicable for ages after 5 years. The survivors at age 5 
would have to be obtained from the known deaths and populations year by year for 
the ages before 5. 

Farr attempted the problem of eliminating a particular disease from the life 
table population, taking, among other diseases, as examples, cancer and phthisis. To 
do this in the case of cancer he constructed a new life table on the basis of the 
mortality from all diseases except cancer, viz. m K — using the short method above 
described. This he assumed gave a life table population as it would be if there were 
no cancer mortality. 

I have first tried to ascertain how closely the numbers of survivors and expectations 
of life at different ages obtained by this method approximate to the more exact 
results obtained by D’Alembert’s full formula. 

The amount of error due to the approximations used in the short method of 
constructing an ordinary life table was investigated by Farr and is given in 
Table 58 of the Supplement^ for the case of the English Life Table for Males. 
The result is an excess in the short method in the expectation of life, to the amount 
of *37 to *53 of a year in the mid-age groups up to 65, followed by a rapid increase 
in the excess at the older ages. 

Table 60 of the same Supplement + gives survivors of a life table based on the 
deaths for the years 1861 — 70, and also the survivors for a life table with cancer 
excluded. I have worked out, and show in Table I, the expectation of life for both 
these tables for comparison with the results tabulated in Table VI of my previous 
paperj. 

TABLE I. 

Life Tables for Males ( calculated from the Facts recorded during 1861 — 70). 


Age x 

To die of all Diseases 

Cancer excluded 

Increase in i % 


4 

i* 

4 

0 

510 622 

40*55 

610 622 

40-75 

*20 

5 

367 817 

50*32 

367 835 

50*65 

•33 

10 

353 129 

47-31 

353 165 

47*65 

*34 

]& 

345 341 

43*32 

345 393 

43*61 

*29 

20 

334 867 

39-60 

334 951 

39*90 

*30 

25 

321013 

36-20 

321 126 

36-51 

•31 

35 

290 755 

29*45 

291 032 

29-76 

•31 

45 

254 138 

22-97 

254 890 

23*27 

*30 

55 

209 825 

16-77 

211563 

17*02 

•25 

65 

150 844 

11*37 

153 945 

11-52 

*15 

75 

77 409 

7*41 

80 501 

7*46 

•05 

85 

17 826 

5*45 

18 970 

5-44 

•01 

95 

789 

5-11 

823 

5-11 

*00 

105 

9 

— 

9 

— 

— 


* See Table 59, p. clxix. Supplement to Thirty ’Fifth Annual Report of R,*Q. t 1881 — 70. 
t Ibid, % AnnaU qf Eugenic Vol. iv. p. 809. 




M. Noel Earn 


93 


The order of the differences in the last column of this table is never greater 
than about a third of a year for the period 1861 — 70. Whatever method is used, 
the results show that these differences have increased to nearly one and a half years 
in mid-life for the period 1919 — 23. 

I turned next to the modern data and have found the effect of the elimination 
of cancer mortality from Life Table 9, applying Farr’s short method instead of the 
formula of D’Alembert. 

In Table II are the results obtained by Farr’s short method from the data to 
which the full formula of D’Alembert was applied in my original paper. 

TABLE II. 

Comparison of the Increases in Expectation of Life due to the Elimination of Cancer 
Mortality, calculated by two Methods. 


Age 

L for Life 
Table 9 

4 for Life 
Table with 
Cancer elimi- 
nated by use of 
D’Alembert’s 
formula 

Increase 
in 4 by 
D’Alembert’s 
formula 

4 for Life 
Table with 
Cancer elimi- 
nated by use of 
Farr’s short 
method 

Increase 
in 4 

Farr’s shoit 
method 

Excess by 
Farr’s 
method 

Percentage ex- 
cess on increase 
in expectation 
of life due to 
using Farr’s 
method 

0 

55-62 

56-89 

1-27 

56-97 

1*36 

•08 

6 

5 

58*81 

60-25 

1-44 

60-35 

1-61 

•10 

7 

10 

54*64 

56*10 

1*46 

56-19 

1-55 

•09 

6 

15 

50* 12 

51-59 

1-47 

51*68 

1-56 

•09 

6 

20 

45*78 

47-26 

1-48 

47*37 

1*59 

•11 

7 

25 

41*60 

43-09 

1-49 

43*21 

1-61 

*12 

8 

30 

37*40 

.38-91 

1*51 

39*03 

1*63 

•12 

8 

35 

33-25 

34*77 

1-52 

34-88 

1*63 

11 

7 

40 

29*19 

30*71 

1*52 

30*84 

1-65 

•13 

9 

45 

25-22 

26*72 

1-50 

26-85 

1*63 

•13 

9 

50 

21-36 

22-79 

1-43 

22*93 

1*57 

*14 

10 

55 

17-73 

19*02 

1*29 

19*17 

1-44 

•15 

12 

60 

14-36 

15-45 

1-09 

15*63 

1*27 

*18 

17 

65 

11-36 

12-20 

*84 

12-38 

1*02 

•18 

21 

70 

8-75 

9-34 

•-59 

9*53 

-78 

•19 

32 

75 

6-59 

6-96 

-37 

7*20 

•61 

•24 

65 

80 

4-93 

5-14 

•21 

5-40 

•47 

•26 

124 


The difference due to using the short method ranges from '08 to ’26 of a year 
in excess, and would increase the additional expectation of life by the elimination 
of cancer from a maximum value of 1*52 years at 40 to 1'65 years, that is, by 
9 per cent. 

An estimate of the amount of error permissible by using a short method is given 
by Dr Snow* as '08 of a year, and if this is to be taken as a criterion the evaluation 
of the increase in the expectation of life in this special problem would not be 

* “An Elementary Rapid Method of Constructing an Abridged Life Table,” E. C. Snow. Supplement 
to the Seventy-Fifth Annual Report of the R.-O., Part n. 




94 Life Tables with eliminated Diseases 

TABLE III. 


Comparison of the Increases in Expectation of Life due to the Elimination of 
Pulmonary Tuberculosis Mortality , calculated by two Methods. 




4 for Life Table 
with Pulmonary 

Increase 

4 for Life Table 
with Pulmonary 

Increase 

Excess by 
Farr’s 

Percentage ex- 
cess on increase 

Age 

4 for Life 

Tuberculosis 

in 4 by 
D’Alembert’s 
formula 

Tuberculosis 

in 4 by 

in expectation 
of life due to 
using Farr’s 
method 

Table 9 

eliminated by use 
of D’Alembert’s 
formula 

eliminated by 
use of Farr’s 
short method 

Farr’s short 
method 

method 

0 

55-62 

57*40 

1-78 

67*48 

1*86 

•08 

5 

5 

58*81 

60-79 

1-98 

60*89 

2-08 

•10 

5 

10 

54*64 

66*63 

1-99 

56-72 

2*08 

•09 

5 

16 

60*12 

52-07 

1*96 

52*17 

2-05 

*10 

5 

20 

45-78 

47*58 

1-80 

47*70 

1-92 

*12 

7 

25 

41*60 

43*13 

1-53 

43-24 

1-64 

•11 

7 

30 

37*40 

38-67 

1-27 

38*79 

1*39 

•12 

9 

35 

33-25 

34*28 

1*03 

34-41 

1-16 

13 

13 

40 

29*10 

29*98 

■70 

30*12 

•93 

’14 

18 

45 

25-22 

25*80 

*58 

25*94 

*72 

•14 

24 

50 

21*36 

21-76 

•40 

21*91 

*55 

*15 

38 

55 

17-73 

l 17*99 

-26 

18-15 

*42 

-16 

62 

60 

14-36 

1 14*61 

•15 

14-71 

•35 

•20 

133 

65 

11*36 

11*44 

-08 

11*64 

*28 

•20 

— 

70 

8*75 

8*78 

*03 

9-01 

•26 

*23 

— 

75 

6*59 

6-60 

*01 

6*91 

*32 

*31 

— 

80 

4*93 

4*93 

•00 

5*31 

•38 

.... 

•38 



TABLE IV. 

Comparison of the Increases in Expectation of Life due to the Elimination of 
Heart Diseases Mortality , calculated by two Methods. 


Age 

4 for Life 
Table 9 

4 f«r Life Table 
with Heart 
Diseases elimi- 
nated by use of 
D’Alembert’s 
formula 

Increase 
in 4 by 
D’Alembert’s 
formula 

4 lor Life TaLle 
with Heart 
Diseases elimi- 
nated by use of 
Farr’s short 
method 

Inorease 
in 4 by 
Farr’s short 
method 

s 

Excess by 
Farr’s 
method 

Percentage ex- 
cess on inorease 
in expectation 
of life due to 
using Farr’s 
method 

0 

65*62 

57*32 

1-70 

57-41 

1-79 

-09 

5 

5 

58*81 

60*74 

1*93 

60-85 

2-04 

•13 

6 

10 

54-64 

66-58 

1*94 

56-67 

2-03 

•09 

5 

16 

50*12 

52*03 

1-91 

62-12 

2*00 

•09 

5 

20 

45*78 

47*67 

1*89 

47-78 

2*00 

*11 

6 

25 

41*60 

43*45 

1*85 

43-57 

1-97 

*12 

7 

30 

37*40 

39*20 

1*80 

39-32 

1-92 

•12 

7 

35 

33*25 

35*01 

1*76 

35*11 

1-86 

*10 

6 

40 

29*19 

30*90 

1*71 

31-02 

1*83 

32 

7 

45 

25-22 

26*89 

3-67 

27*00 

1*78 

•11 

7 

50 

21-36 

22*97 

1*61 

23*10 

1-74 

•13 

8 

55 

17*73 

19-26 

1*53 

19-39 

1-66 

13 

9 

60 

14-36 

15*77 

1*41 

16-94 

1-58 

*17 

12 

65 

11-36 

12-61 

1*25 

12*78 

1-42 

•17 

14 

70 

8-75 

9*77 

1*02 

9*96 

1-21 

*19 

19 

75 

6-69 

7-39 

•80 

7*62 

1-03 

•23 

29 

80 

4*93 

5-50 

•57 

6-76 

•83 

1 

*26 

46 




ML Noel Earn 


95 


sufficiently accurately worked by the short method of Farr, especially when a com- 
parison is to be made of the alteration in the expectation of life due to the elimination 
of cancer mortality for two or three past decades. 

Farr's short method applied for the elimination of Pulmonary Tuberculosis and 
Heart Diseases separately from the ordinary life table shows differences in expecta- 
tion of life ranging again from *08 of a year at birth to *38 in the case of Pulmonary 
Tuberculosis, and from *09 to 26 in the case of Heart Diseases when compared with 
the fuller method, as shown in Tables III and IV. 

These differences become large in the later ages when regarded as percentage 
excess on the increase in expectation of life, especially in the Pulmonary Tuberculosis 
investigation. This shows that the method of Farr is not accurate enough for this 
purpose. 

Re-calculation in five-yearly Groups of the Data on which Life Table 9 is based. 

In estimating the additional expectation of life resulting from eliminating a cause 
of death by the shorter method, the calculation should perhaps be made not from 
the standard life table, but from one based on the same data but calculated also by 
the same method. The comparative values of the normal expectation of life calculated 
on this basis are set out in Table V. 

TABLE V. 

Expectation of Life, 


Age 

For 

Life Table 9 

For 

Life Table calcu- 
lated in five-yearly 
periods 

Excess by 
Farr’s method 

0 

55-62 

55-71 

*09 

5 

58-81 

58-91 

*10 

10 

54-64 

54-74 

10 

15 

50-12 

00-21 

•09 

20 

45-78 

45-89 

*11 

25 

41-60 

41-72 

*12 

30 

37*40 

37*62 

*12 

35 

3325 

33*37 

*12 

40 

29-19 

29*32 

*13 

45 

25-22 

25*36 

*14 

50 

21*36 

! 21-51 

*15 

55 

17*73 

17*89 

*16 

60 

14-36 

14*66 

*20 

65 

11*36 

11-57 

•21 

70 

8*75 

8-98 

•23 

76 

6-59 

. 6*90 

•31 

80 

4*93 

5*31 

•38 

85 

3-72 

4*14 

•42 


In Table VI the increases in the expectation of life due to the elimination of 
the several diseases considered are shown, the results by Farr's method being com- 
pared with the standard life table re-calculated by the same method. The percentage 




96 


Life Tables with eliminated diseases 

difference on the increase is now diminished, as compared with the results in Tables 
II, III and IV, to an amount less than one, throughout the table in the case of 
Pulmonary Tuberculosis, and to age 60 in the table with Cancer eliminated. 

TABLE VI. 


Increase in Expectation of Life. 



With Cancer eliminated 

With Pulmonary Tuberculosis 
eliminated 

With Heart Diseases eliminated 

Age 

By Farr’s 
method com- 
pared with 
re-calculated 
Life Table 

Ab in 
Table 11 

By 

D’Alembert’s 
formula com- 
pared with 
Life Table 9 

By Farr’s 
method com- 
pared with 
re-oaloulated 
Life Table 

As in 
Table III 

By 

D’Alembert’s 
formula com- 
pared with 
Life Table 9 

By Farr’s 
method com- 
pared with 
re-caloulated 
Life Table 

As in 
TablelV 

By 

D’Alembert’s 
formula com- 
pared with 
Life Table 9 

0 

1*26 

1*35 

1-27 

1-77 

1*86 

1*78 

1*70 

1-79 

1*70 

5 

1*44 

1*54 

1*44 

1*98 

2*08 

1-98 

1-94 

2*04 

1*93 

10 

1*45 

1*55 

1*46 

1*98 

2-08 

1*99 

1*93 

2*03 

1*94 

15 

1*47 

1*56 

1*47 

1*96 

2*05 

1*95 

1*91 

2*00 

1-91 

20 

1*48 

1*59 

1*48 

1*81 

1*92 

1*80 

1*89 

2*00 

1*89 

25 

1 *49 

1*61 

1*49 

1-52 

1-64 

1-53 

1-85 

1-97 

1-85 

30 

1*51 

1*63 

1*51 

1*27 

1*39 

1-27 

1*80 

1-92 

1*80 

35 

1*51 

1*63 

1*52 

1*04 

1*16 

1*03 

1*74 

1*86 

1*76 

40 

1*52 

1*65 

1*52 

•80 

•93 

•79 

1*70 

1-83 

1-71 

45 

1*49 

1*63 

1*50 

•58 

•72 

•58 

1-64 

1*78 

1*67 

50 

1*42 

1*57 

1*43 

•40 

•55 

*40 

1-69 

1-74 

1*61 

55 

1*28 

1*44 

1*29 

*26 

•42 

•26 

1*50 

1-66 

1*53 

60 

1*07 

1*27 

1*09 

•15 

•35 

•15 

1-38 

1*58 

1*41 

05 

*81 

1*02 

*84 

•07 

•28 

•08 

1*21 

1*42 

, 1-25 

70 

*55 

•78 

•59 

•03 

•26 

•03 

*98 

1*21 

1*02 

75 

•30 

•61 

*37 

•01 

•32 

•01 

•72 

1*03 

| *80 

80 

*09 

*47 

*21 

•00 

•38 

•00 

•45 

•83 

•57 


Table VII shows the percentage difference for some of the later ages in the 
tables with Cancer or Heart Diseases eliminated, Farr’s method now giving a defect. 


TABLE VII. 



With Cancer eliminated 

With Heart Diseases eliminated 

Age 

Defect in increase 
in expectation of 
life due to Farr's 
method 

Percentage 

defect 

Defect in increase 
in expectation of 
life due to Farr’s 
method 

Percentage 

defect 

45 

•01 

1 

•03 

2 

50 

•01 

1 

•02 

1 

55 

•01 

1 

•03 

2 

60 

•02 

2 

•03 

2 

65 

•03 

4 

•04 

3 

70 

•04 

7 

•04 

4 

75 

*07 

19 

•08 

10 

80 

•12 

57 

*12 

21 




M. Noel Kabn 


97 



Biometrika xxv 


7 



98 


« 

Life Tables with eliminated Diseases 

In deciding whether the short method is sufficiently accurate when a particular 
disease is to be eliminated the form of the curve of mortality rates of the disease 
must be considered. In Fig. 1 on the previous page are given: 

The curves of annual mortality rates per 100,000 : 

(1) for deaths from all causes, 

(2) for deaths from heart diseases, 

(3) for deaths from all causes except cancer in five-yearly groups are given 

• for the period of the data under consideration. 

Similar curves have already been given for cancer and for pulmonary tuber- 
culosis in the former paper. 

The curve for deaths from all causes rises rapidly towards the end of life, the 
curve for heart diseases rises in the same way but at a more gradual slope, and the 
curve for all deaths except cancer follows a course similar to that for all deaths. 

The curve of mortality for pulmonary tuberculosis is different in form. The 
slope is generally gradual whether it is ascending or descending. 

Provided then that one starts from a life table constructed in the same way, 
the short method of Farr is seen to lead to tolerably accurate differences in life 
expectation due to elimination of the diseases which have been under consideration 
in this memoir, except in the cases of cancer and heart diseases for ages after 60 
when the inaccuracy increases. The method however involves the additional work 
of first re-computing the standard life table on the basis of five-yearly instead of 
yearly age-groups. 

The error introduced in the short method arises from the fact that the numbers 
saved from the disease in a particular five-yearly period are regarded as exempt from 
risk of death from other diseases for the whole of the five years. JThis is not the 
case, for those saved from the particular disease will be subjected to the death-rate 
from other diseases from the moment in which they would have died of the special 
disease, which connotes on the average for half the period. This error does not 
occur when infinitesimal periods are used, in the use of D’Alembert’s formula there 
is the additional advantage that a series of annual cancer (or other) mortality rates 
is obtained which is of interest and value in itself, and which proved to be of use 
in other problems dealt with in my former paper. 

Some Work on the same Problem from Data of the Metropolitan Life 
Insurance Company of New York . 

Some work on the same problem has been published in the Statistical 
Bulletin * of the Metropolitan Life Insurance Company of New York, under 
the title “ Effect of Cancer upon the Length of Life ” and “ Loss in Expectation of Life 
on account of organic Heart Diseases,” for data of the Industrial Population 
1911 — 1916. The tables for males, white, are reproduced as Table VIII. 

* Stat. Dull M.L.I. Co., Oct. 1920, Vol. i. No. 10, p. 5; Ibid. Feb. 1921, Vol. n. No. 2, p. 6. 



M. Noel Earn 


99 


TABLE VIII 


Age 

Number of years of Average 
After Lifetime LoBt on Account 
of Canoer. (Ail Forms. ) 
White, Males 

Average Number of Years of 
After Lifetime Lost on Account 
of Organic Diseases of the Heart. 
White, Males 

0 

0*62 

1*67 

1 

•70 

1-86 

2 

•72 

1*91 

3 

•73 

1-94 

4 

•73 

1*95 

r> ; 

•72 

1-95 

16 

•73 

1'93 

25 

•75 

1*89 

36 

•79 

1*88 

45 

•80 

1-87 

55 

'73 

1-81 

05 

•53 

1-56 

75 

•39 

119 

85 

*30 

•96 

95 

•17 

•41 


The column referring to cancer gives smaller figures than those for English 
data of about the same period, viz. 1909 — 1913, as given in Table VI of my former 
paper, but the results resemble those of English data of the last three decades in 
rising to a maximum loss of years between 40 and 50 years. 

The results for heart diseases are very similar to those obtained from the English 
data 1919 — 1923, given in Table V a of my original paper. 


The formula used to obtain the results given in Table VIII has been com- 
municated to me by Messrs Dublin and Lotka as 




q%- 9x {i) 

i 


A 


where q x denotes the usual life table function when all causes of death are effective, 
q x (i) denotes the corresponding life table function when only cause i of death is 
effective, and q x ( ~ i} denotes the corresponding life table function when all causes 
except i of death are effective. 


The formula A may be obtained as follows: 

The probability of living for one year at age x at risk of death from all causes 
is equal to the product of the probability of living for one year when only cause 
i of death is effective and the probability of living for one year when all causes 
except i are effective, that is 

. \ 1 - g* « (1 - q x ®)(\ - q x '-*>); 


q^ 


q*-9* {l) 

1 -qx*' 




100 


Life Tables with eliminated Diseases 

p» denoting the usual life table function when all causes of death are effective, 
p x ® denoting the corresponding function when only cause i of death is effective, and 

denoting the corresponding function when all causes of death except i are 
effective. 

1 have applied the formula A to the data in hand in yearly periods and find 
that it gives a very near approximation to the formula of D’Alembert, the survivors 
in the life table excluding a special disease being the same to within a few units 
in 100,000 starting life together, at all ages through life, whether the disease con- 
sidered is Cancer, Pulmonary Tuberculosis, or Heart Diseases. The expectations of 
life are exactly the same for the two methods. 


Comparison of Formula A with Farr’s Method. 
The formula which Farr used would be 

m x - m x i] = m x ( ~ { \ 


3 


where m x — „ , and m x {{ \ ni x '~ l) have similar meanings. 
^ - 9 * 


Substituting these values for the m’s Formula B gives on reduction 


<-<> fr. -fr! 0 .. 


!-?/' + 


qiqx 

4 


which differs from formula A in the denominator of the right-hand side. 


The amount of the difference in the expectation of life given by Formula B as 
compared with that given by D’Alembert’s formula, or by Formula A, has already 
been shown. k 

It may be of interest to apply Formula A to the cancer data in five-yearly 
periods in order to compare the expectation of life with the results found in yearly 
periods. The results are given in Table IX. 


This table shows that there is no sensible error involved in computing the 
additional expectation of life resulting from elimination of cancer as a cause of 
death by using five-yearly periods, so long as the life table with which comparison 
is made is computed in similar periods. 

In conclusion, the difference in the methods used lies in the evaluation of qj, 
D’Alembert’s formula giving instantaneous values, Dublin’s formula values at 
yearly intervals, and Farr’s values at quinquennial periods. 

For rapidity of calculation, combined with accuracy, the formula giving yearly 
values has some advantage over that giving instantaneous values. 

In either case the results are arbitrary to some extent, as the original figures 
are usually only obtainable in quinquennial age-groups. 



M. Noel Karn 


101 


TABLE IX. 

Expectation of Life and Increase in that Expectation over that of a Standard 
Life Table in a Population excluding Cancer. 


Age 

Oalonlated in 
yearly periods 
by formula A 

4 

Increase in 
expectation 
of life over 
Life Table 9 

Calculated in 
5-yearly periods 
by formula A 

Standard 

Life Table 
(see Table V) 

4 

Increase in ex- 
peotation of life 
over Standard 
Life Table 

0 

56-89 

1-27 

56*99 

55-71 

1*28 

5 

60-25 

1*44 

60*37 

58-91 

1*46 

10 

56*10 

1-46 


54-74 

1*47 

15 


1*47 

51*69 

50-21 

1-48 

20 

47-26 

1*48 

47-39 

45*89 

1-50 

25 


1*49 


41-72 

1-51 

30 

38-91 

1-51 

39*04 

37-52 

1-52 

35 

34-77 

1*52 

34-90 

33-37 

1-53 

40 

30-71 

1-52 

30*86 

29*32 

1-54 

45 

26-72 

1-50 

26*87 

25-36 

1-51 

50 

22-79 

1*43 

22*96 

21*51 

1-45 

55 

19*02 

1*29 

19*20 

17*89 

1-31 

60 

15-45 

1*09 

16-60 

14*56 

1*10 

65 

12-20 

*84 

12-42 

11-57 

*85 

70 

9*34 

•59 

9-67 

8-98 

•59 

75 

6*96 

*37 

7*26 

6*90 

•36 

80 

5-14 

•21 

5*51 

5-31 

•20 


BIBLIOGRAPHY. 

(1) Karn, M. N. An Inquiry into Various Death-Rates and the Comparative Influence of 

Certain Diseases on the Duration of Life. Annals of Eugenics , Vol. iv. Parts hi and iv, 
May, 1931. 

(2) Registrar- General s Fifth Annual Report. Appendix. 

(3) Statistical Bulletin Metropolitan Life Insurance Company of New York. Oct. 1920, Vol. i. 

No. 10. Feb. 1921, Vol. n. No. 2. 

(4) Supplement to the Registrar- QeneraVs 85th Report . 

(5) Supplement to the Registrar-OeneraVs 75th Report, Part li. 




A TEST OF THE SIGNIFICANCE OF THE DIFFERENCE 
OF THE CORRELATION COEFFICIENTS IN NORMAL 
BIVARIATE SAMPLES. 

By FRED A. BRANDNER, State University of Iowa. 

I. Introduction . 

The problem of testing the significance of the difference between correlation 
coefficients, T\ and r*, found in two independent samples of size n\ and n* may be 
considered as that of testing the hypothesis that the samples have been drawn 
from populations in which the coefficients of correlation between the variables have 
some common, but unspecified value, p. A method of procedure commonly used, 
which is adequate if the samples are large and p not too near either 4- 1 or — 1, is 
to compare the difference, ri— with an estimate of its standard error, a ryrr% . 
But if these conditions are not satisfied, we are at once faced with certain diffi- 
culties : (a) the value of a ri - r% is very sensitive to the particular estimate of p 
chosen ; (6) the sampling distribution of - r* will be asymmetrical, difficult to 
calculate and again dependent on the estimate of p. 

To meet this difficulty, R. A. Fisher has suggested the use of the trans- 
formation* 

* “ i { io ge (1 + r) - loge (1 — r)} (1), 

because it then follows that if the two samples have been drawn from normal 
populations with a common p, Z\ — z a will be distributed in repeated samples, 
approximately normally about zero with a standard error given by 

**»-*. = V, ^_3 + Wi T3 -< 2 )- 

That is to say, by adopting this transformation, a test is obtained which is 
both easy to apply and, but for a certain approximation f, completely independent 
of the unknown value of p. 

In a problem of this nature, it is evident that an indefinite number of criteria 
might be found to use in testing the hypothesis of a common p. Owing to the 
nearly invariant form of its sampling distribution, the criterion z x — z t might be 
chosen on intuitive grounds as one of the most efficient (as well as most convenient 
in application), but it is nevertheless of some interest to examine the logical basis 

* Metron i. 4, pp. 12 — 18. Statutical Methods for Research Workers, Section 85. 

t The nature of the approximation involved has been examined at various times. See for example 
Biometrika xxi. pp, 357 — 860; Journal of the American Statistical Association , June 1982, pp. 127 128. 



Peed A. Brandner 


108 


tor the choice between criteria. In a series of recent papers 0 J. Neyraan and 
E. S. Pearson have discussed how, when the hypothesis to be tested and the set 
of admissible alternatives have been defined, the appropriate criterion may be 
deduced from certain fundamental principles, without any preconceived notion 
of what the form of the criterion ought to be. Making use of what has been 
termed the likelihood ratio, they have shown how a number of existing tests and 
certain new ones are brought into conformity, and my purpose is to consider the 
application of this method to the present problem. 


II. The Likelihood Ratio . 

The problem may be stated as follows. Two samples, 2i and 2 a of size ni and 
w a , have been randomly drawn from normally distributed bivariate populations, 
III and IT* (variables x and y). The two means, two standard deviations and 
product-moment correlation coefficient are defined as follows : 

For n*($ = l, 2), 

a t > a t\ &i> &t\ Pt> 

For 2, (* = 1,2), 

Vt'y s t\ r t . 

The admissible hypotheses concern the set fl of all possible pairs of bivariate 
normal populations. The hypothesis we shall test is not that IIj and Il a are 
identical, but merely that their correlation coefficients have the same value, 
or that 

. .. . . . pi*=P2 = p (3), 

while the relations 

~ d>i } 0*1 a* (Jj, <Tj / = <7g / 

will not necessarily be satisfied. The population pairs (IIi, n a ) for which (8) is 
true, form a subset a > of the set Q. The likelihood ratio, is to be obtained by 
choosing from fl and o>, respectively, the two population pairs which make the 
chance of the observed sampling result a maximum. 

The chance of obtaining 2i from IIi with character values falling in the ranges 
(xi ± \h, yi ± %k) (i « 1, 2, . . . , rii) will be asymptotic to 


Ci 


__1 l"i 

i'o-piVj 




v 1 f ( y + / y«-«i'y _ ga (*<-°i) (y<-« iTj 

i-lLV J V <r\ / J 


(hk)«i 

,(4) 


2tt<7i0- 1 / 

as h and k approach zero. 

Likewise for the sample 2* of ti* observations, the chance is given by 

r * i in* ~ 1 ? 17* - *■ V i (v*r a * \* - fa ~ <**) - vn 

C*** I fl e ® / \ *'1 / -l(AA) n * 

(*)• 

* See Biometrika xx A . pp, 176, 264. Bulletin de l’ Academic Polonaise del Sciences et dee Lettree. 
S4rie A, 1980, p. 73. 



104 Significance of Difference in Correlation Coefficients 

The combined probability of the occurrence is given by the product 

C(fl) = <?!(?* (6). 

If the values (4) and (5) are substituted in equation (6) and the sums simpli- 


fied, the result may be written 

c { q.)=ku r 2 — r 

L^-v 2 (i -p?) 


„ | w <_ riZizOif + 'i* 2p i {(3c l -a < )(y i -a/) + w/!“l 

xe i-l2(l-pflL J(Wfe)*...(7), 

where N * n\ + w 2 and K = (27 r)~^. 

Finding the values of a», a/, etc. (i « 1, 2) to make G((l) a maximum gives 

«* fa, <ri = 8i , a-/ * */, p{ = r t * (8). 

This gives the pair of populations of maximum likelihood. By substituting 
relations (8) in (7) the chance of the joint occurrence becomes 


C(n imx )=K 


Now consider the chance of the samples belonging to the subset to defined 
above. The total probability is given by 

A 
12 


1 1“ 

•r 1 t 

..W (1 - ri*)*. 

l_W(l ~>V)iJ 


c< “>-^U-T[r-y 

_ l l nh - «<P + «i , ( Vi- + _ 2 p | (x, - a, 

xe 2 <W)AL •? 


-«i) ®r"/) + r iV('! 

< r i< 


i (h,ky...( io). 


Again, if C (<u) be maximized in respect to a it a/, etc. (» = 1, 2) it gives 


(A) «; = •«;, a/ = yi 


Xi 


1 -p* 


<r? 1 -Up 


./* 


(B) 

(0) (ih r * + Kiri) p 2 -N( 1 + nr*) p + «ir! + v t r t = Oj 

Upon solving equation (C) of (11), 

A(1 + r t r») - VJV 8 (1 - rxr g ) g - 4n 1 n a (r^r*)* 


.( 11 ). 




2 (nir* + n,ri) 


•( 12 ). 


This gives as a maximum chance 

0 <"-> - * [iff S&T W 


Thus we may obtain the likelihood ratio 



Frkd A. Brandner 


105 


III. The Case when ni = «* =* n. 

The above ratio cannot in general be expressed simply, but when 

n x = n%**n 


by putting 


r x = tanh z x , r a «= tanh z 2t p = tanh z 


(15), 

,(16), 


the result 


Z X + Zi 
2 


(17) 


is easily obtained from equation (11 C). Also, from equation (13), 


0 (°w) 


cosh cosh z% 


S X 8 X 8 2 8 2 cosh 2 


Zt-Z* 


e~ N (hk) N (18). 


If the same substitution is made in equation (9), it gives 

O (n^)-K r c08h Z X°^ -**T (Aft)* (19). 

L S l S l S 2 S 2 J 


Finding the ratio of the likelihoods and extracting the (2w)th root gives 


X*» =* sech Zl ~2 Zi (20). 

It follows that if the criterion used is 

i 

z\ — z 2 = 2 sech -1 \in (21), 


it will be identical to that of R. A. Fisher referred to above. In other words, the 
contours of constant Vs correspond exactly to the contours of constant values of 
(z\ — z 2 ). As the likelihood of the hypothesis decreases z x — z 2 increases, and the 
method already described can be used to determine the significance of the differ- 
ence of the observed correlations of two normally correlated bivariate samples for 
which nothing is known as to the values of the parameters involved. 


IV. The Gase when v} X j*n%. 

The case of n x not equal to n 2 may now be considered. Without loss of gene- 
rality we may assume r x > r 2 . In order to express \ in simple terms it is here found 
necessary to introduce an approximation for the value of p to be inserted in (13). 
Equation (11 C) may be written in the form 

|j[Vi + »•*) - ni ~ ri * (ri - r,)J p* - N (1 + rir s ) p + j (r, + r,) + W *(n - r,) = 0 

( 22 ). 

Then by substitutions (16), 


[tanh (*i + z t ) — e]p* -2p + [tanh (zi + z t ) + e] = 0 (23), 


where 


n, -n t ri — r t 
N '1+rir* 


(24). 



106 Significance of Difference in Correlation Coefficients 

From (23) 

, , * 1 - Vsech*(* 1 + * 2 ) + « 2 

r tanh (zi 4- z 2 ) - e 

1 — Vi — [tanh a (*i + ^ 2 ) ~ c 2 ] 

tanh + * 2 ) - € 

_ 1 — [1 — -j- {tanh 2 (£j + 52) ~ e a j - A { t anh 2 (*1 + fg) - € 2 ) 2 . 

* tanh (zi 4- z 2 ) - c 

If all powers of [tanh 2 (z\ + £2) — e 2 | greater than the second, which in most caBes 
will be small, are neglected in (25) the approximate value 

_ \ [tanh 2 ( z x - f z 2 ) - e 2 ] 4- j [ tanh 2 (zi 4 z 2 ) - € 2 ] 2 /nnv 

^ tanh (zi 4“ z 2 ) — e ‘' f 

will result. The above approximation tends to slightly decrease the value for p. 


•] 


...( 25 ). 


mi e 22 — 1 A A 2 B 

Thus p*»+ 1 _ 2” + 8 

where A = tanh (z\ 4- z 2 ) 4- € and B — tanh (zi 4- z 2 ) — e 


Solving for e 22 gives 


from which 
2 z = log ^ 


1 4. 4 + 

+ 2 + 8 




1 + 2 + "8 
1 2 8 


2 8 ' ) 


( 27 ), 

.(28). 

(29), 


A A*B 
2 + 8 


1/A /4 A*B\ 1/4 , 

"'2(2 +_ r) + + (l + 8 ) + sU + 


8 ) ‘ 


.(30). 


Again, by neglecting all terms in A and B of the second order and higher powers, 
another slight decrease will be made in the value for p. This gives the approxi- 
mation 

z * £ [tanh (zi 4- z 2 ) 4- e] 

1 

"2 


7’i +_ r 2 Tli — _w* 

l4-7ir 2 + N 


n -r 2 1 
’ 1 4-rir a J 


.(31). 


_ w 1 r 1 4*w a ? < 2 

~ jy(l 4-?’xr a ) 

These slight decreases in the value of p, and consequently the value of z as noted 
above, are almost exactly counterbalanced by again approximating in (31). By 
expanding each of the values for r< (i « 1, 2) in terms of Zi, the numerator is slightly 
increased and the denominator is decreased by discarding powers of Z{ of second 
degree and higher. The value 

, L-i n i z i + r * 2 z 2 

z = tanh 1 ^ = — 

is thus obtained as an approximation for the true value of z. 


.( 32 ) 



Fred A. Brandner 


107 


The accuracy of this approximation for p for widely varying values of rg and r* 
is shown in Tables I and II, where the exact maximum likelihood value from 
equation ( 12 ) is denoted by p M , and the value obtained from equation ( 32 ) by p a . 
Both p u and p a depend only on the ratio of n x to it t , and the cases examined are 
for % = 2«j and «g = £??g (Table I), and nj = 6« 2 and % «* | n t (Table II), where we 
have taken the higher of the two sample correlations as rg. It will be noticed that 
when ni > n», p a « pu and when «i< « 2 , p a ^p M . This shows that the sample con- 
taining more observations carries greater weight in determining the maximum 
likelihood z, than is allowed for in the weighted arithmetic mean of Z\ and z t . 

Needless to say for either wi = w 2 or rj = r s , p M and p tt will agree exactly. 


TABLE I. 






Cane n l = 27 i. i 

CaBe 


*1 

*2 

n 

To 

Pm 

Pa 

/*'(», =20) 

Pm 

Pa 

p'( Wl = 10) 

ViV+T 

02 

0*0 

*1974 

•0000 

•133 

•133 

•141 

•067 

•067 

•058 

0-45 

0-4 

0*0 

*3800 

•0000 

•262 

•261 

•276 

•131 

•133 

•116 

0*89 

0-7 

0*0 

*6044 

•0000 

•442 

•436 

•459 

•221 

•229 

•201 

1-56 

0-4 

0-2 

-.3800 

•1974 

■322 

*322 

*329 

•200 

•261 

•253 

0*45 

0*7 

0*2 

*6044 

•1974 

•490 

*488 

•504 

•348 

•351 

•334 

1*11 

1-1 

0*2 

*8005 

•1974 

•674 

*664 

•684 

•448 

•462 

•432 

2*00 

1*5 

0*2 

•9052 

•1974 

•808 

•788 

•808 

*523 

•560 

•522 

2*89 

0*7 

0*4 

*6044 

•3800 

•538 

•537 

•546 

•462 

•462 

*452 

0*67 

1*1 

0*4 

•8005 

•3800 

•704 

•700 

•714 

•555 

•560 

•540 

1*56 

1*5 

0*4 

•9052 

•3800 

•823 

•812 

•827 

•625 

•645 

•617 

2*45 

2*3 

0*4 

*9801 

•3800 

•949 

•931 

•941 

*706 

•775 

•742 

4*23 

W 

0*7 

•8005 

•6044 

•748 

•747 

•755 

•681 

•682 

•673 

0*89 

1*5 

0*7 

*9052 

•6044 

•847 

•844 

*853 

*741 

*747 

*732 

1-78 

2*3 

0*7 

•9801 

•6044 

•953 

*943 

•950 

•814 

•844 

•823 

3*56 

1*5 

1*1 

•9052 

•8005 

*878 

•878 

•882 

•843 

•844 

•839 

0*89 

2*3 

1*1 

•9801 

*8005 

•960 

•956 

•960 

•898 

•905 

•896 

2*67 

2-3 

1*5 

*9801 

•9052 

•967 

•966 

•968 

•942 

•943 

•939 

1-78 


It should be noted that the estimate of p suggested by R. A. Fisher* is based 
on a weighting of z\ and z% inversely proportional to the approximate values of 
their sampling variances, namely 


>' = tanh- p~ - 3) 4^ ^- a [ (33). 


N - 6 


This result does not correspond either to the maximum likelihood value of (12) 
or to the approximation (32). p' will depend upon the actual values of «g and »*, 
and has been computed for the cases 


«i a* 20, n% as 10 ; and »g “10, n* “ 20 (Table I) j 
ni “ 60, wg “ 10 ; and «i “ 10, n% * 60 (Table II)] 


* Metron i . 4, p . 18. 




108 Significance of Difference in Correlation Confidents 


TABLE II. 






Case fii=6n 2 

Case 

a 

*1 

H 

n 

r 2 

pH 

Pa 

p'(n,=60) 

PH 

Pa 

/>'(«, =10) 



*1974 

•0000 

*170 

•170 

•176 

•028 

H 

*022 

0*49 

m 


•6044 

*0000 

•544 

•537 

•554 

Kiw 

•100 

•076 

1-75 

0*7 


•6044 

■ 

*559 

•557 

•668 

•262 

*265 

•249 

1-25 

1*5 

0*2 

*9052 

■Ml 

*877 

•865 

•876 

•326 

•368 

•329 

3*25 

1*1 

0*4 

*8005 

•8800 

•765 

•762 

•771 

•455 

•462 

•443 

1*75 

KkS 

0*4 

•9801 

•3800 

•973 


•970 

•507 

•586 


4-74 

1-6 

0-7 

*9052 

•6044 

•885 

•882 

•888 

•665 

•672 

•657 

2-00 

m 

1-1 

*9801 

*8005 

•974 

•972 

•974 

•843 

•854 

*843 

3-00 


It will be seen that, although p a is generally a better approximation to p M than is 
p' (for the range of cases taken), this is not always so ; some light is thrown on the 
position by calculating the ratio 


Zl-tt 


Zl-Zt 



+ 


1 

- 3 


(35) 


for the values of n* and n t given in (34). These ratios are entered in the last column 
of each table, and provide a measure of the significance of the differences between 
the pairs of sample values r-j and r % . It will be seen that p a is closer to p u than is 
p' so long as (xj - Zi)l<r tirlt < 2’00, but that when the ratio exceeds 2'00, the posi- 
tion is reversed. Of course, as n* and rij increase, p'->p a ; but the approach of p a 
to pu will depend on the ratio of n t to n t . 


Finally, if the approximation (32) be accepted as adequate, and p„ substituted 
for p M in (13), we obtain from this equation, and from (9), 

c „w-x[^[^]V<«). .« 

“ d 0 '* W) ' K ZhV, 1 -,,)] ' [w c«h"(, _i,)] *■«-*(**)"• -(37). 

Taking the ratio of these likelihoods gives 


X “ ojnlTo = ^ 8ech ^ ~ t secl1 “ **)]"• 

= [»ech | (it - *)J % [sech ^(z t - x^J"* ( 38 ). 

Thus with approximation the contours of constant \ again agree with R. A. 
Fisher’s contours for (x 2 - x 2 ). 

The following example will illustrate the use of the test. Suppose 
«t*=60, n* = 10 ; r,-*6044, 1974 (see Table II). 



















Here 

and 


Fred A. Brandner 


109 


*i=0 - 7, z% = 0'5 

(*i ~ = (0-7 - 0-2)/V^T| - 1-25. 

If we refer this ratio to the normal probability scale we find £(1 + a)« *894. 

We may now reason as follows : since the alternative to the hypothesis tested 
(pi 585 pi ” p) which naturally demands our first attention is that pi > pi, we ask 
what is the chance that zi would exceed z 2 by 0*5 or more, were pi = pg ? This 
chance is £(1 -a)*= *106, and we should conclude that there was no clear call to 
reject the hypothesis, p% =p*, in favour of an alternative pi > pg. Evidently there 
would be far less reason still to reject it in favour of some alternative, pi< pi. 

V. Conclusion . 

The problem discussed is that of testing the hypothesis that two samples have 
been drawn from populations in which the coefficient of correlation has some 
common but unspecified value. 

It is assumed that the variation in the populations sampled is normal. Following 
the method of Pearson and Neyman for testing what they have termed a composite 
hypothesis*, a criterion, A, has been obtained which for n% is exactly, and for 
other cases closely, related to the criterion (zi-z 2 ) suggested by R. A. Fisher. 
This result is analogous to others which have been reached by the two writers in 
the papers referred to above, in so far as it shows that by employing the method 
of likelihood to determine the appropriate criterion to use in testing a statistical 
hypothesis, we are lead to certain standard tests. In this particular case Fisher’s 
z - test was first reached from a quite different line of approach. 

* It may of coarse be argued that a hypothesis of this type cannot be tested by calculation of a 
single criterion with single probability measure, and that we should determine separately the significance 
of the differences between r, and p*, and r s and p M . 



PLURAL BIRTHS WITH A NEW PEDIGREE. 


By JULIA BELL, M.A., M.R.C.P. 

(1) Some two years ago, Dr E. A. Barton, working in connection with the Obstetric 
Department of University College Hospital, came across an interesting history of 
plural births in three generations of a family, transmitted, in the particular case 
under his observation, through a male, III. 10, who was believed himself to have 
been a single birth. Mr Herbert Spencer warns us* that statistics of patients 
treated in lying-in-wards are alone reliable in respect of multiple births, as one of 
a twinship often dies and the fact of a twin pregnancy may not be mentioned to 
the patient. Howevei, their history of multiple births has been of interest, not 
unaccompanied by anxiety, in the family whose pedigree is given under Fig. 1, 
and so far as his mother and his grandmother were aware, III. 10 was born at a 
single birth. 

It will be seen from Fig. 1 of the adjoining plate that one quadruplet, probably 
two triplets and seven pairs of twins have been born in five sibships. 

Dr Barton kindly sent the mother, III. 3, to see us after the birth of her 
second pair of twins, IV. 12 and 13 ; she is an intelligent woman, with no history 
of twins or multiple births in her own family ; she and her husband provided most 
of the facts shown in the pedigree. There was some difference of opinion in the 
family regarding the triplet of III. 18 — 20; the mother, II. 12, befieving she had 
no triplet, whilst her mother, I. 4, declared that this triplet was born in Middlesex 
Hospital, and resulted from a shock to the mother on hearing that her elder son, 
III. 10, was to have a diseased bone removed from his face. This experience fully 
supports Mr Herbert Spencers warning. Dr E. A. Cockayne kindly obtained 
verification of the fact from the Registrar of Middlesex Hospital. It is of interest 
to note that though both parents, II. 11 and 12, had siblings belonging to multiple 
births, they themselves apparently produced eight children at single births in 
addition to two pairs of twins and one triplet. There is some uncertainty about 
the triplet II. 4—6 ; its occurrence is a family tradition, but nothing is now known 
regarding the sex of the children or whether any of them survived birth. 

II. 12, who is still living, is sure that her mother had only one multiple birth 
in a large family; this consisted in a quadruplet of four boys named Matthew, 
Mark, Luke and John ; all the boys lived for some time, but died before the age 
of 20 years. Of the fifteen children of II. 11 and 12, only four are now living ; the 


In a personal letter to Professor Earl Pearson. 



Julia Bell 


111 


triplet of this sibship was bom prematurely and all its members died. One brother 
of this family, II. 21, was killed in the war. One twin sister, III. 14, is still living; 
she is married and has had two confinements, with three daughters ; one of the 
twin daughters died young, of diphtheria. 

III. 3, the wife of III. 10, has had six children at four confinements; each pair 
of twins were of like sex, but their mother says the children are not at all alike ; 
one of the first pair of twins died aged 13 months from pneumonia. 

III. 3 is a young woman, aged 30 at the birth of her last twins ; she suffers 
from asthma, and, with a husband unemployed for nearly two years, considers 
anxiously the possibility of further multiple births ; she remarked that she had 
indeed married into a dangerous family. 

(2) Remarkable examples of multiple births have been reported from time to time 
since the days of Aristotle. These include : (a) Individual cases of large numbers 
of children born at a single birth. (6) Histories of successive plural births to the 
same individual ; of some special interest are a number of histories, some of them 
undoubtedly authentic, recording a sequence to the same individual of the same 
type of multiple birth, the mother producing, say, always twins, or always triplets, 
(c) Family histories recording plural births in a number of individuals of the 
same stock; the more interesting of these refer to cases in which the liability 
to the occurrence is transmitted through the male, as may be seen in our Fig. 1; 
also in Figs. 2 and 3. 

(a) Writing in 1850 ( Notes and Queries , p. 459), Richard Owen says that the 
largest number at a birth of which any authentic record appears is five ; there 
was at this time, in the Royal College of Surgeons, a specimen jar containing five 
foetuses from one birth, of which three were still-born, the two bom alive survived 
for a short time only*. 

Garthshore, writing in the Phil. Trans. 1787 “f, recorded the occurrence of this 
particular case ; he states that he had employed various friends at Petersburg, 
Berlin, Vienna, Lyons, Paris and Ghent to collect for him well authenticated 
cases of this kind, and that he had not yet been able to procure any. Garthshore 
collected and published at that date a number of records of multiple births, but 
expressed the opinion that, when we advance further than five at a birth, we get . 
into the region of tradition and improbability. 

Foy{, who in 1890 again collected all records he could find of plural births, 
including quadruplets and upwards, expresses the opinion that we may err by an 
excess of incredulity, and that we need to be very cautious before rejecting definite 
statements made by medical men recording their personal experiences. 

* The specimen may still be seen in the Museum of the College, together with several more reoent 
examples. 

f Bibl. No. 7. 


t Bibl. No. 12. 



112 


Plural Births with a New Pedigree 

Ambroise Par6* expresses no doubt in quoting the statement of Martin 
Cromer, the Polish historian, that * one Margaret, a woman sprung from a noble 
and ancient familie near Cracovia, and wife to Count Verboslaus, brought forth 
at one birth thirtie five live children f, upon the twentieth date of January in the 
year 1296 Par6 was a contemporary of the historian and presumably might have 
satisfied himself of the evidence on which the statement was made. We also learn 
from this distinguished surgeon that “ Franciscus Picus Mirandula writeth that 
one Dorothie, an Italian, had twentie children at two births ; at the first nine , and 
at the second eleven, and that shee was so big that shee was forced to bear up her 
bellie, which laie upon her knees, with a broad and large scarf tied about her neck, 
as you may see by this figure ” — an illustration of the said Dorothy is given, 
with her scarf in position. At a later date, in 1684, Dr Seignette writes of an 
“ Accouchement Surprenant” which he had seen at Rochefort, where “une femme 
de Xaintonge 6toit accouch6e de neuf enfans , tous bien formes, et ausquels on 
distinguoit le sexe ; et que cette m6me femme l’ann^e pr6c6dente avoit accouche 
de onze% .” It is difficult to believe in or to deny the possibility of any of these 
occurrences. 

Was Martin Cromer (1512 — 1589) a reliable historian? He reports that 
Mathias Golancevius, Bishop of Vladislavia in Poland, was the only survivor 
of twelve sons delivered at one birth, the rest dying as soon as they were born§. 
The name here recalls that of Margarita Gonzalez of Valencia, whose remarkable 
fecundity was noted by Hcnrique Cock when travelling in Spain with Philip II 
in 1585. “The midwives and several doctors of Valencia are witnesses, counting 
the children that her two husbands had the good fortune to have. They found 
one hundred and forty-four males and fourteen females. Amongst them there 
were baptised forty-nine sons and three females ||.” This case is obviously either 
a fabrication, or some misprint has occurred, or the mid wives and doctors were 
unable to count, for we are told that this Margarita had thirty-three parturitions 
between the ages of 15 years and 35 ; the woman, however, may have provided 
an example of remarkable fecundity, and my attention was drawn to it by the 
similarity of the name to that of the Polish report. Margarita Gonsalez was said 
to be the daughter of a Basque father and a Parisian mother ; she married first 
a Neapolitan and secondly a Basque, so was cosmopolitan with regard to her 
connections. She was again pregnant when the travellers left Valencia. 

If Ambroise Par6 was ready to be credulous, Owen and Garthshore would 
appear to have understated the case. There are certainly numerous authentic 
accounts of six at a birth, and there is what surely one must accept as definite 
evidence of a septuplet, from a memorial stone on a house at Hameln a. Weser ; 

* Bibl. No. 1. 

t In editions of Cromer’s work, published in German in 1562 and in Latin in 1589, 1 find the number 
given as thirty-six live ohildren. The oft repeated story of the Grafin von Henneberg and her 865 
children at a birth is undoubtedly a myth. 

t Bibl. No. 6. § Bibl. No. 2, p. 197. 


II Bibl. No. 8. 



Julia Bell 


113 


kneeling parents and seven babies in swaddling clothes are represented on the 
stone below a crucifix, and the following inscription is given : 

Allhier ein Biirger Thiele Homer genannt 
Seine Hausfrau Anna Breyors wohlbekannt. 

Als man zahlte 1600 Jahr 
Don 9 Januarius de« Morgens 3 Uhr war 
Von ihr zwei Kniibelein und fiinf Magdolein 
Auf eine Zeit geboren sein. 

Haben auoh die heiligen Tauf erworlien 
Polgends den SO 1 ®* 1 12 Uhr Seelig gestorbon. 

Gott wolle ihn [-cn] goben die Seeligkejt, 

Die alien Glaubigen ist bereit. 

Obiges original Denkmal hat durch die Giito dor Herrn Biirgermeistor Domeier, der jetzige 
Bositzor dieses damahls Homerschen Hauses Gerichtssohreiber Hoppe wioder orhalton und auf- 
gestellt im Jahr 1818. 

A photograph of this stone is given by Barfurth*, whose account, in 1895, is 
the first reference to it which is known to me. We must conclude that seven at 
one birth is the greatest number of which we have an authentic account, but we 
have no reason to believe that this number has never been exceeded. 

(b) With regard to successive plural births in the same individual, the most 
astonishing report is one whose first account I find in The Gentlemans Magazine 
(Vol. LIU. p. 753, London, 1783): “In an original letter now before me, dated 
St Petersburg, Aug. 13, 1782, O.S., Feodor Wassilief, aged 75, a peasant, said to 
be now alive and in perfect health, in the Government of Moscow, has had — 

By his first wife : By his second wife : 

4x4= 16 6 x 2 = 12 

7 x 3 = 21 2 x3= 6 

10 x 2 = 32 8 births 18 children. 

27 births 69 children. 

In all 35 births, 87 children, of which 84 are living and only three buried.... The 
above relation, however astonishing, may be depended upon, as it came directly 
from an English merchant at St Petersburg to his relatives in England, who added 
that the peasant was to be introduced to the Empress.” 

This history was published independently — with a caution — by Hermann^, 
writing on Statistics of the Russian population in 1790. My impulse was to 
reject the case as unworthy of serious consideration, as apparently the cautious 
Garthshore did in 1787. However, from a statement in the Lancet of 1878 (Vol. i. 
p. 290), we learn that a few years earlier the French Academy of Science had 
endeavoured to obtain verification of the occurrence ; they appealed to M. Khani- 
koff of the Imperial Academy of St Petersburg for advice as to the means they 
should pursue, but were told by him that all investigation was superfluous, that 
members of the family still lived in Moscow and that they had been the object of 
favours from the Government. Are we to accept this case as an established record ? 

* Bibl. No. 14. t Bibl. No. 8. 


Biometrika xxv 


8 



114 


Plural Births with a New Pedigree 

What then of the following report *: “In the year 1755 a Muscovite peasant, 
named James Kyrloff, and his wife were presented to the Empress of Russia. 
This peasant had been twice married and was then 70 years of age. His first wife 
was brought to bed twenty-one times, four times with four children each time, 
seven times of three, and ten times of two, making in all fifty-seven children who 
were then alive. His second wife, who accompanied him, had already been delivered 
seven times, once of three children, six times of twins.” Surely both these Russian 
cases must be regarded as under suspicion; other similar cases have been reported, 
one of which referred to a handworker in Lille who had 82 children by two wives. 
There is, however, this point of interest about all these and other reports — they 
may exaggerate the details, but they each suggest the probable occurrence of a 
rather remarkable sequence of multiple births in the two wives of one man, and 
thus indicate in these cases examples of the probably inherited liability to produce 
multiple births, transmitted by the male. 

At a much earlier date Aristotle writes ( History of Animals, Lib. vn. Cap. 4): 
“As a rule and in most countries women have but one child at a birth; yet 
frequently and in many districts they bear twins, particularly in the land of Egypt. 
But even three and four occur at a birth, and this quite frequently in certain 
places as it has been stated above. A woman does not bear more than fivo at a 
birth, and this has been observed to happen several times. Indeed a certain woman 
bore twenty children in four parturitions, five at each, and most of them were 
reared.” This cannot be regarded as proven evidence of the occurrence of a sequence 
of four quintuplets, but the reference is of great historic interest, and there is 
some measure of probability regarding its accuracy. 

To return now to Ambroise Paref. He writes: “In our time, between Sarte 
and Main, in the parish of Seaux, nor far from Chambellay, there is a fauiilie and 
noble hous called Maldemeure ; the wife of the Lord of Maldemeure, the first year 
shee was married brought forth twins, the second year shee had three children, the 
third year four, the fourth year five, the fifth year six, and of that birth she died ; 
of those six one is yet alive, and is Lord of Maldemeure.” We must receive with 
caution Park's recital of cases taken from other writers, but this statement re- 
garding a noble family in his own country, which could be refuted any day if 
untrue, cannot easily be rejected. It is of some interest to note the increasing 
number recorded at each birth in this history, in view of the fact that there is a 
good deal of evidence in favour of the statement that on the whole plural births 
tend to occur later in life than single births ; the average age of the mother at tho 
birth of twins being older than that at the age of single births, and the average 
age of the mother at the birth of triplets being greater than that at the age of 
twins. So that mistrust of the case, which has been suggested on the grounds 
of the improbability of the regularly increasing number at birth, loses perhaps 
some of its justification. 


Bibl. No. 5. 


+ Bibl. No. 1. 



Julia Bell 


115 


None of these remarkable cases really carries conviction; we cannot refuse to 
believe in the possibility of their occurrence, but if we were able to obtain verifi- 
cation of one extreme case, other accounts would assuredly receive a measure 
of support. I have tried unsuccessfully to get into touch with a more recently 
published case; in 1886 the Naples correspondent of the Paris Register writes*: 
“ About twenty -five miles from here, and by rail two or three stations beyond 
Pompeii, is the historical city of Nocera. In it lives Maddalena Qranata, aged 47, 
who was married at the age of 28 to a peasant just nineteen years ago. Maddalena 
Granata has given birth to, either dead or living, fifty-two children, forty-nine of 
whom were males. She enjoys florid health, is robust, and twenty-four hours after 
her last accouchement was ready to go out to her accustomed labour in the field.... 
Her physician, Dr de Sanctis of Nocera, says that there is not the least exagge- 
ration in these statements.... She has had triplets fifteen times.” It seems almost 
incredible that anybody should have triplets fifteen times. There should be no 
difficulty in finding out whether there is any truth in this report ; a letter to the 
village Syndicate has brought no reply ; probably a visit to Nocera, by a traveller 
in those parts, would be the most satisfactory way of getting into touch with a 
descendant of Maddalena Granata or of Dr de Sanctis. 

A similar history, published in the Gazette Medicate de Lyons (Oct. 1, 1863)f, 
reports that the wife of a medical man at Fuentemajor in Spain, aged 43, had just 
been delivered of three girls ; it was the thirteenth time she had been confined of 
triplets. 

A number of authentic cases of a sequence of twin births are on record ; the 
mother of the famous Dr Lettsorn had twins seven times, all of whom were males]:; 
Dr Lettsorn and his twin brother were the last children borne by her, and were 
the only two who survived. A patient of Mr Herbert Spencers told him that she 
herself had had six pairs of twins and no other pregnancy. 

A history due to Peiper (Fig. 2 of our pedigree plate) shows a sequence of nine 
twin births; each pair of twins were of unlike sex. Is there a tendency for a sequence 
confined to one type of plural birth to be uniform with regard to sex ? The only 
case of such a sequence known to me refers to three triplets consisting of nine 
boys; there was an interval of 18 months between each birth; all the boys are 
living. Uniformity of the male sex may well have occurrod in the triplets of 
Maddalena Granata, if we can trust that history, though she did not have exclu- 
sively triplets and we are only told that forty-nine of her fifty-two children were 
males. 

An epitaph of some interest is referred to by Hakewill§ in 1635: “Neither 
can I call to minde any example in all antiquity parallel to that of a woman buried 
in the Church at Dunstable who (as her epitaph testifies) bore at three severall 
times three children at a birth and five at a birth two other times.” This epitaph 

* See Medical Preen and Circular , Vol. i. for 1SS6, p. 57. London, 1886. 

t Reference in The Lancet > Vol. u. for 1868, p. 466. London, 1868. 

X Bibl. No. 9. § Bibl. No. 4. 



116 Plural Births with a Nev) Pedigree 

is given by Francis Thynne in his collection (Cleopatra, c. 3, p. 114) from the MS. 
in the Cottonian Library; it was copied by him in Sept. 1583, and is given in 
the appendix to Hearne’s 1733 edition of Chronicon sive Annales Prioratus de 
Dunstable . Several reproductions have been published with some variations in 
spelling. Francis Thynne’s manuscript is not easy to decipher by the unskilled, 
and we are greatly indebted to Professor H. E. Butler for copying the epitaph 
from the script for us ; it runs as follows : 

Hie William Mulso sibi quam sociauit ot Alice 
Marmore sub duro conclusit nex(?) generalis 
Ter tres bis quitios natos haec fertur habere 
Per sponsos binos, Deus his clemens miserere. 

Wc are also much indebted to Mr T. W. Bagshawe, of the Dunstable Library 
and Museum, for a description of the memorial stone (taken from Derbyshire’s 
History of Dunstable, 1872). “In the middle aisle, opposite the pulpit is a large 
slab, beneath which is buried a woman who had nineteen children at five births ; 
viz. three several times three children at a birth, and twice five at other times. It is 
not strange that this account is frequently disbelieved ;...but if tradition the most 
straightforward can be accredited, it is a literal fact. Upon the slab were the 
figures of a man and woman in brass, both dressed in gowns, with their hands in 
the attitude of prayer, at their feet was the inscription. Beneath the latter were 
two groups, one of boys and the other of girls, with the types of the evangelists at 
the corners/’ Derbyshire, writing in the nineteenth century, quotes the inscription 
from Hearne’s edition of the Chronicles; he does not state where he found the 
description of the slab, which is not, so far as I have discovered, to be found in 
Hearne’s volumes. 

Hakewill’s interpretation of the epitaph has been repeatedly quoted ; indeed, 

I must confess to having myself been ready to present the case as an authentic 
account of a sequence of three triplets and two quintuplets. Professor Butler, 
however, is of the opinion that the epitaph by no means justifies this conclusion ; 
he considers that all we can accept from the epitaph is that Alice had nineteen 
children by two husbands, nineteen being expressed as “Ter tres bis quinos” to 
meet the needs of the verse. The date of the stone is not given, all we know is 
that it was prior to 1583. Possibly further information was available to Hakewill ; 
in the absence of any reference to it we must reject the case from consideration 
as an undoubted example, which would have been of interest for our purposes, and 
express gratitude to Professor Butler for his caution. 

(c) I will now add a few illustrative examples showing the occurrence of plural 1 
births in the same family ; these cases are all relatively recent and the histories 
are probably accurate as far as they go. Peiper, in 1923*, Fig. 2, publishes an 
extremely interesting history of a woman, II. 4, married to a man who had a twin 
sister; they had a sequence of nine pairs of twins; each pair included a male and 
a female ; all the eighteen children died, and one is reminded of the statements of 


# Bibl. No. 17. 





X 








Julia Bbll 


117 


Aristotle* and Pliny regarding the high mortality of twins of unlike sex — such 
a pedigree as this in Aristotle’s day might readily lead to a tradition, which of 
course would have no general application. It is however of interest to remember the 
mortality of Dr Lettsom’s sibship. The mother of the twins in Peiper’s case had 
nine siblings and so far as was known no twins had been bom in her family ; this 
woman married a second husband and had six single births, four daughters and 
two sons; the first of these children, III. 19, died aged three months; III. 20 — 23 
all died at birth ; the last child, III. 24, delivered by Caesarian section, was the 
only one of the mother’s twenty-four children who remained alive; moreover III. 24 
was not without difficulties as he had to be operated on for spasm of the pylorus. 
The interesting points of this pedigree are the marked inheritance of twinning, 
through the father ; also the long sequence of twins of unlike sex, without any 
triplets or single births. The high mortality rate would appear probably to be due 
to the mother rather than to her twins, since five of her six single births by 
a second husband shared the same fate. 

Another interesting small pedigree of twinning, due to Strassmannf , is shown 
in Fig. 3 ; it includes six pairs of twins of whom four pairs were known to be of 
like sex, one pair included both sexes ; four of the cases had arisen through the 
father. The only female of the family to be married, II. 4, was herself a twin ; she 
had only single births ; the male twin of II. 4 was twice married and had twin 
children by both wives ; the only male of a single birth to marry had twin offspring. 
No triplets or higher multiple births are noted to have occurred in this family. 
Fig. 4, due to OriggJ, provides a marked contrast to the two previous cases in that 
it presents a history of multiple births including twins and triplets, transmitted 
through the female in every case, so far as we can judge. The family history is 
very incomplete ; it was given to the recorder by his patient, IV. 7 ; she said 
that her mother, III. 7, had two brothers who were married, neither of them had 
any children ; we are not told whether these brothers bolonged to the triplet of 
that generation, or were twins. Did these brothers abstain from parenthood, for 
perhaps economic reasons, or were they infertile on physiological grounds ? Apart 
from this reference we have no knowledge of any male of the family having married, 
nor are we told how many males had been born in the various sibships. IV. 7, 
aged 44 at the time of observation, had had sixteen children, triplets twice with 
ten single births; she had seven daughters living; we are not told whother the 
remaining nine children were males or females or at what age they died. The 
eldest daughter of IV. 7 had four children, of whom three were born at one birth ; 
the second daughter of IV. 7 was also married and was pregnant at the time of the 
record. A maternal great-aunt of IV. 7, aged 90 and still living, said that her 
grandmother had told her that triplets had occurred in the family as far back as 
any record could be obtained. We have tried to get in touch with this exceedingly 
interesting family, but have not succeeded in doing so. It is very unfortunate that 

# De Animalibus , Libor vn. Cap, iv. Aristotle, after stating that mixed sexes in a litter do not 
affect surviving in the case of animals, adds ; 14 but in the case of men few survive of twins if one is 
female and the other male.” + Bibl. No. 16. J Bibl. No. 13. 



118 Plural Births with a New Pedigree 

the only two males of the family of whom we have any record married, but had no 
children. 

Three further cases showing the liability to multiple births transmitted, so far 
as we can judge, through the mother seem worth putting on record. The Lancet 
(Vol. I. for 1889, p. 392) reports a case which was communicated to the Lisbon 
Medical Society by Senor Pereira da Cruz, of Aveiro. A woman had had four 
confinements in eight years ; first, twins were still-born or died soon after birth ; 
the following year a triplet, including two boys and one girl, was born ; five years 
later a quintuplet was born, of which the first child lived 50 days, the second 
lived 28 hours, the remaining three were still-born ; two years later the woman 
had a single birth of a still-born child. The mother of this sibship had two sisters 
and an aunt who were said to have been the subjects of similar multiple preg- 
nancies ; there is no mention of plural births among the children of any male of 
the family, nor are we told of any plural births in the mother’s sibship; no 
information is given as to whether the aunt who had plural births was on the 
maternal or on the paternal side. The case is thus very imperfectly described, but 
the history of the quintuplet in association with other plural births, of the heavy 
mortality and of the similar history on the mother’s side of the family provide 
positive information of considerable interest. In examining this case and the 
following we must remember Mr Spencer’s warning that when one member of 
a twinship dies the fact often remains unrecorded ; we should not perhaps regard 
it as demonstrated that the mother of either of these sibships was certainly born 
alone. 

In 1905, 1)r Roberts, of New South Wales, described a case of his experience*. 
A woman, I. 2. of Fig. 6, aged 32, had been married twelve years; she had a family 
history of twinning on her maternal side; she had a personal history of seven 
confinements at the first of which twins were delivered, at the*- second a triplet 
which was born prematurely at seven months and did not survive ; there followed 
a single birth ; the fourth confinement produced twins ; the fifth and sixth were 
two single births; then there was a miscarriage and eighteen months later a 
quadruplet consisting of three males and a female of whom one male was still-born. 
Ten of this woman’s fourteen children were living. At the birth of the quadruplet 
four adherent placentas were removed, they were distinct and separate; four 
separate bags of membranes are described. 

Finally, it is perhaps worth while to include here the case of Dr de Leon, 
reported by Foy f. A woman aged 26 had given birth to fourteen children by two 
husbands; by her first husband she had one pair of twins and four single births; 
by her second husband she had eight children in three years — two pairs of twins 
and a quadruplet ; all members of the quadruplet were living, the weight at birth 
of the children was 6 lbs., 5 lbs., 4J lbs., and 4 lbs. respectively. 

This short account of a few selected cases and references is extremely limited 
in its scope; the purpose of it is primarily to suggest through a few illustrative 


Bib). No. 15. 


t Bibl. No. 12. 



Julia Bell 


119 


examples the varied problems which arise in any consideration of the inheritance 
of multiple births. The historic extreme cases, which we must assuredly receive 
in a sceptical and doubting spirit, cannot for the most part be disproved, but 
unfortunately under modern social conditions it is difficult to believe that the 
possibility of some of them could ever again be demonstrated, at least in European 
populations. It is indeed astonishing that in 1886 the report concerning Madda- 
lena Qranata did not attract more interest and comment and obtain verification ; 
if this be an accurate history, it certainly justifies some belief in many other reports 
which have been discarded as too improbable for consideration. In our own country 
it is almost equally surprising that the family history due to Grigg was allowed to 
be lost sight of and remain without any adequate description. How very incurious 
most of us appear to be with regard to remarkable occurrences which do not touch 
upon our own immediate needs and occupations. 

1 would call attention to one further point of some importance and interest. 
Dr R. A. Fisher, from his very valuable study of Triplet Children*, concludes 
that “ The triplet data indicate that the paternal influence is only exerted in the 
production of diembryony.” Now my very small collection of a few exceptional 
cases cannot be used to prove anything at all, but Figs. 1 and 2 do suggest that 
Dr Fisher’s experience is not of universal application. In Fig. 1, the twin births 
of IV. 10 — 13 arc presumably due to their father’s influence, each pair is of like 
sex, but their mother had no hesitation in her statement that they were very 
unlike in other respects. Again, in Fig. 2, due to Peiper, the long sequence of 
twins of unlike sex must be attributed to their father’s influence. 

Should the production of multiple births be regarded as a mark of exceptional 
vitality, or is it a sign of degeneracy with lack of control, or can we deduce nothing 
from the evidence available ? There appears to be a good deal of evidence sup- 
porting the view that on the whole the older mothers tend to produce multiple 
births, but we arc not justified in accepting this as a sign of weakness or decadence 
on her part; so far as I know, it is not necessarily the mothers who are worn out 
by repeated child-births who have twins, and we need to be very cautious in 
drawing any general conclusions from the mere fact of the age of the mother at 
the birth of her offspring. 

For obvious reasons the infant death rate is greatly augmented at multiple 
births, and thus it would appear to be a wasteful form of reproduction. As long 
ago as 1820, Merrimanf gave figures from the Dublin Lying-in-Hospital referring 
to the years 1787 — 1793, showing that of 368 twin children bom to 184 mothers, 
171 °/ 0 died in hospital or were still-born, whereas 10,199 uniparous women only 
lost 7 % of their children. Mitchell gives' worse figures for the same hospital, 
pointing again to the excessive mortality of twin babies in 1862. Strassmann 
quotes Prussian statistics since 1871 { indicating that the numbers of still-born 
children (a) at single births, ( b ) among twins, and (c) among triplets are as 
3’3 : 5’8 : 121. Mitchell§ formed the impression, and gives some figures to support 

* Bibl. No. 18. t Bibl. No. 10. $ Bibl. No. 10. § Bibl. No. 11. 



120 


Plural Births with a New Pedigree 

it, that twins and triplets occurred with rather a marked frequency amongst idiots 
or in the families of idiots, but I have no knowledge that his suggestion has been 
confirmed by an adequate investigation. 

Dr Fisher found no evidence suggesting that at a fixed age the surviving 
members of triplets were undersized as compared with children of single births. 
It is of interest to hear from Mr Herbert Spencer of a woman patient of his over 
six feet in height, who was one of a quadruplet; her three brothers were also 
over six feet in height and were serving in the same regiment in India. 


BIBLIOGRAPHY OF WORKS CITED. 

1. PauIc, Ambroisk: The Work* of. Translated by T. Johnson. London, 1649. [References 

on pp. 654, 655.] 

2. Cromer, Martin: De origine et rebus gestis Polonorum libri ALVA'. Coloniae Agrippinae, 

1589. [References on pp. 164, 197.] 

3. Cock, Henrique : Relation del viaje hecho por Felipe //, en a Zaragoza , Barcelona y 

Valencia , escrita por Henrique Cock. Madrid, 1876. [ References on pp. 248, 249.] 

4. IIakewill, G. : An Apologie or Declaration of the Power and Providence of God in the 

Government of the World. London, 1635. [Reference on p. 253.] 

5. Wani/r^, N. : The Wonders of the Little World: or a General History of Han. London, 1678. 

[Revised and enlarged, 1806. Reference on p. 81.] 

6. Seignette : Accouchement sarpronant. Le Journal des Spawns, T. for 1684, p. 92. Paris, 

1684. 

7. Garthshore, M. : A remarkable (Lise of Numerous Births, with Observations. Phil. Trans. 

Vol. lxxvii. pp. 344 - 358. London, 1787. 

8. Hermann, B. F. J. : Statistische Schilderung von Rmsland. St Petersburg, 1790. 

9. Pettigrew, T, J.: Memoirs of the Life, and Writings of the late John Cottkley Lettsom , M.D., 

F.R.S . London, 1817. [Reference in Vol. i. on p. 5.] 

10. Merriman, S. : A Synopsis of the various kinds of Difficult Parturition. London, 1820. 

11. Mitchell, A.: Plural Births in Connexion with Idiocy. Dublin Medical Press , Vol. ii. for 

1862, pp. 526 — 527. Dublin, 1862. 

12. Foy, G. : Plural Births. Medical Press and Circular > Vol. u. for 1890, pp. 304—308, London, 

1890. 

13. Grigg, W. C. : Heredity as to Triplets. British Mediad Journal , Vol. I. for 1890, p. 541. 

London, 1890. 

14. Barfurth, D, : Ein Zeugma fiir eine Geburt von Siebenlingcn boim Monschen. Anatomischer 

Anzeiger , Bd. x. H. 330—332. Jena, 1895. 

15. Roberts, L. W. : A Case of Quadruplets, British Medical Journal , Vol, ii. for 1905, p. 629. 

London, 1 905. 

16. Strassmann, P. : Die Anthropologisohe Bedeutung der Mehrlinge. Zeitschrift f. Ethnologic, , 

Bd. xl. 8. 362 - 382. Berliu, 1908. 

17. Peipkr, A. : Zur Vererbung der Zwillingsschwangerschaft durch den Mann. Klinischc 

Wockenschrift , Bd. n. 1932, & 1651. Berlin, 1923. 

18. Fisher, R. A.: Triplet Children in Great Britain and Ireland. Proc. R. Vol. cn. B. 

pp. 286—311. London, 1928. 



ON CORRELATION FUNCTIONS OF TYPE III. 

By S. I). WICKSELL, Lund (Sweden). 


1. As originally shown by Helmert* the distribution of the second order 
moments in samples of N individuals, chosen at random from an infinitely large 
supply (parent population) will be of the form generally called Type III if the 
supply be normally distributed and the moments are taken around the true mean 
(mean of the supply). Helmert also has shown that the same form of error dis- 
tribution will be obtained if the sample moments arc taken around the respective 
sample means, a result which has later on been rediscovered by several writers. 

t 

It will be the main purpose of this paper to study the correlation surface 
obtained for the second order moments in samples of x and y, which are taken at 
random from a normally distributed bivariate supply (supposed to be infinitely 
large). As it is clear that the marginal distributions of this surface will both be of 
Type III, we have here an interesting object of investigation, i.e. a solid Type III 
distribution. 


The most convenient way to study problems of this kind seems to be by the 
aid of the so-called reciprocal or characteristic functions of the distributions. These 
functions are defined in the following way : 


Univariate case : 

U (t) = | dx f(x) e xt 

(1). 

Bivariate case : 

U (h , h) = Jl dx dyf (x, y) e’Vw's 

(2). 


Here f denotes the frequency function of the distribution in question and U its 
characteristic function. The integrations are to be extended over the whole range 
of applicability of f. 

Evidently U ( t , 0) and U (0, t) are the respective characteristic functions of the 
marginal distributions of the correlation / (%, y ). 


If the Napierian logarithm of U is developed into powers of t (or ti and f 2 ) the 
coefficients of this expansion correspond to the so-called seminvariants X (of Thiele) 
in the following way : 


Univariate case : 




(3). 


Bivariate case : 


log U(t lt t a ) = 


5 S 

2i rZkW. 


h k h l 


(4). 


* See Biometrika, Vol. xxm, 1931, p. 410. “Historical Note on the Distribution of the Standard 
Deviations of Samples of any Size drawn from an Indefinitely large Normal Parent Population.” 
Editorial. 



122 On Correlation Functions of Type ITT 

If the characteristic function of a distribution has been found the theorem of 
Fourier gives the following solutions of the integral equations (1) and (2): 


/ (®) = vr f dw U ( wi ) e -!twi (5), 

Z7TJ-..00 

and f{x, y)= [ f dwidw 2 U (wii, w 2 i) (6). 

(*7 T ) J -oo.' —co 


We shall not here go into the questions of convcrgency of this inversion as it is 
quite evident that no troubles of this kind will arise in the applications contained 
in this paper. 

Furthermore, as is easily seen, the characteristic function of the distribution of 
a function of x , {z = g(x)}, is given by the equation 

U(t) = jdxf(x)e f,{x)t (7), 

and the characteristic function of the bivariate distribution of two different 
functions of x and y, = </(#, ;</), z 2 = h(x, y )}, is given by the equation 

U(t i, f 2 ) = j*J dxdyf(x , y)e inx ' y)l (8). 

Putting h=- 0 in this equation we obtain the characteristic function of the 
distribution of a function g ( x , y) of a pair of correlated variables x and y. 

Finally, it is well known and easily demonstrated that the characteristic function 
of the distribution of the sum of a number of independent variables is equal to 
the product of the characteristic functions of the distributions of the respective 
variables taken alone. 

This we may state in the following form : If Ui(t) is the characteristic function 
of the distribution of x i} then the characteristic function of the distribution of 

Z^Xx + X^ + X 3 f ... +®n 

is 0X0- Uft). U N (t) (!)), 

if the variables x\ 9 x 2f x 2) ... x N are distributed independently of each other. 

Similarly: If Ui(t 1} t 2 ) is the characteristic function of the correlation of any 
pair x i} yi, then 

U (t X> ty)=* t 2 ). U 2 (tl, t 2 ). U 2 (t 1} t 2 ) ... UjT {tl i t 2 ) 

is the characteristic function of the correlation of Zi — x i + x 2 + ... + x N and 
- 2 ’ 2 - 2 /i + y 2 + ... if the different pairs x it y\ are chosen independently of each 
other. 

2. I shall begin by demonstrating Helmerts proposition with the aid of the 
characteristic function. 

Assuming the frequency function of a? in the supply to be 



8. D. WlCKSELL 


123 


/jfZ 

the characteristic function of the distribution of is, according to (7), equal to 

. ** > o _2 

La ■' ( 10 ). 




Consequently the characteristic function of the distribution of the second order 
moment about the “ true” mean, z = S# 2 , in samples of N will, according to (9), be 


N 


U(t) = (l- 2 p) 2 (11). 

On account of (5) we finally get the frequency function of z from the equation 
fn (*) = 2^. \ "dwtr"* (l - wij 2 (12). 


On evaluation the integral ( 1 2 ) gives the Type III distribution spoken of. 
This will most easily be seen by the transformation 


2(7 2 

ir w = T ’ 


zN 

2 <r* 




Ne~S f°° -- 

which gives fy ( 2 ) = 9 ^ 2 .)^ I dr(l — ri) 2e a ~ T 'X 

or, putting (1 — ri) f = f, 


N 


~/v (z) = erf f 1 


-1 



N 

2 e*dl; 


(14), 


f being a complex variable and the integration taking place along a line running 
at the distance -f £, parallel to the imaginary axis. As the integrand vanishes at 
the extreme ends of this line the value of the integral will not be affected by 
a variation in f. Hence, as the integral is clearly convergent, it is constant and 
independent of f, and it follows that we must have 

(15). 

An actual evaluation of the 
also follows from the equation 

f f(z)dz = 1 , 

Jo 

which must here necessarily be fulfilled. 


integral will show that we have h = -- — , which 

& r(£jvy 


Thus we find the Type III distribution for z , 

N 

1 / N\Z 


Xn(z) f(£iV) 


(N \ 2 

W) ^ 


N 

e ~ ai 5 * 


..(16). 

Q.E.D. 



124 


On Correlation Functions of Type III 


By taking the logarithm of (11) and expanding in powers of t we get the 
following well-known formula for the seminvariants of Type III as given in (16), 

X r = (r-l)!(^)’"' 1 <x* (17). 

3. A Generalised Form of Type III will be at hand in the error function of the 
second order moment, taken around a fixed point at a given distance a from the 
mean. To derive this function we have, in analogy to (10), the characteristic 

(x ■■■“ a)^ 

function of the distribution of -■ - equal to 

n =(l -jf) (10*), 

which gives for the frequency function of £ = j^X(a: — a) 2 in samples of N the 
following integral form (cp. (12)): 




u-wi 

o , 2cr 2 

l ~ M Wt 

e N 


.( 12 *). 


This function, which passes over into the function (12) and the ordinary Type III 
curve when a— *0, generally* cannot be expressed in elementary functions, except 
as an infinite series. If, for instance, the factor 


- 2<r- . 

i~V w* 
p N 


is developed in powers of the exponent, and it is noticed that according to (12) 
we have, when n < ~ , 


N 


27rj (* “ IT Wi ) " C ~ ZWi = dz"f N ^ ( 18 *)> 

we find the following serial expression : 


Fn(z) ~/n (z) ~ o- 2 /V + 2 (*) + Vf4 (*) ~ vr t /” tf+e (*) + (1 b**), 


21 


3! J 


which series may readily be reduced to a form fit for numerical applications. As 
a matter of fact we may write 

f ^ 4 V+*» ( z ) ** (*" 1)”/ jy (*) -Pn (^)> 

where P n (z) is a polynomial of the nth degree in z. Its general form is, if we put 
2a®* & 


x [rlP>T*“(i) 


r 




r(4A r +n-l)"V2//r(^A) + W 


^n~2 




* When Nrzl (12*) must be iutegrable as it is easily shown that we must have 

a(+^)» 



8. D. Wicksell 


125 


It will be seen that (16**) is a special case of the development of a generalized 
Pearson Type III, given by Romanowsky ( Biometrika , xvi. pp. 114 — 116). The 
objections advanced by Professor Pearson against Romanowsky’s generalisations 
do not apply to the special form here arrived at, as (16**) is, by the nature of the 
problem of which it gives the solution, an analytically well-defined probability 
function with a limited number of arbitrary parameters (one more than in the 
ordinary Type III)* 

Taking the logarithm of (10*), multiplying by N and expanding in powers of t t 
we get, on comparing with formula (3), the following simple general formula for 
the seminvariants of the generalised Type III, as defined by formula (12*), 

*r-(r-l )i(^) r "V+m a ) (17*). 


Forming the standardised seminvariants 


r 



-1 


cr 2 + ra 2 

(a* + W /2 * 


wo see that our generalised Type III will pass over into the normal frequency 
function (of which 7, -0, except for r— 2) not only when N grows, but also with 
a growing value of a 2 


As a frequency function to be fitted to a given set of data the generalised 
Type III has four arbitrary constants, i.e. in the notation here used, the parameters 
2o- 2 

a 2 , -jy and A, and the position, on the scale in which the variate is measured, of 

the origin (starting point) of the curve. These constants may be determined from 
the mean and higher moments in the following way : We first have 

X 2 = Pz ; X 8 = vs 5 X 4 = — 3y a 2 > 

if Pfc denotes the central moment of the &th order. We further put 

s « V9X?-6\ 2 X7. 


This quantity must be real as a first condition for applicability. If s = 0. we have 
the ordinary Type III curve *f\ We now get 

+ 

If Z 0 is the starting point of the curve, we finally have, m denoting the mean 
of z, 

Z 0 = m — cr 2 — a 2 = m — t.™ 2 (3A 8 — 2s) (3X 8 + *) 2 * 

0A4 


* As pointed out by Dr E. S. Pearson the function F N (z) must also bo identical with a function 
studied by R. A. Fisher in connection with his investigations of the “ General Sampling Distribu- 
tion of the Multiple Oorrelation Coefficient” ( Proc . R. Soc. A.V. 121, 1928). Fisher uses another form of 
development, which is obtained if the series (16**) is rearranged according to powers of f. 
t In Pearson's well-known notation we have 


«=x s \/8.< a (6 + 8/9, -2ft). 



120 On Correlation Functions of Type 111 

It is seen that wc must have on the one side (in order that the coefficients 
should be real) 

3X* a >2X,X 4 , 

and on the other side (in order that $N > 0) 

X* >s, 

which gives 4X 8 S < 3X 2 \ 4 . 

It follows that X 4 must be positive (positive excess). The seminvariant X 8 can 
always be considered as positive, as this only requires that the positive direction 
on the 2 -axis is appropriately chosen. 

Thus the above inequalities require that we must have 

8 X 2 X 4 < 12X 8 a < 9X 2 X 4 , 

or, in Pearson’s notation, 8/9 a - 24 < 12& < 9/3* - 27. 

In the j8i, /3* diagram this inequality defines the area between two straight 
lines, intersecting in the point /8i = 0, /9 2 = 3, one of the lines being the line on 
which the ordinary Type III is strictly valid. The angle between the lines is 
rather narrow, but one has the impression that it must embrace an important part 
of the combinations of /3i and /S 2 occurring in practice. 


4. We shall now treat the following : 

Problem 1. From an infinitely large normal bivariate supply samples of N 
pairs of x and y are taken at random. What will be the bivariate distribution of 
1 1 

Z\ = ^.2#*, and 2j/ a , if x and y are reckoned from the true means ? 

1 _ 1 ( ?! _ 2 , 3. + 1\ 

Putting f(x,y)= - — • y - -- 2 (l — r 3 ) \<r t 2 <r,<r., <rfi) (18), 


1 27r V 1 - r a 


wc find from (8) that the characteristic function of the bivariate distribution of 


S = y and h 


(*- ¥*)(*■ 


j4ffi a <r 2 a "I * 

—]yi W* 


Consequently the characteristic function of the bivariate distribution of 21 and 
2 * will be N 

u ft, u - [(1 - (1 - u] ' * (20). 

and the correlation function of Zi and e% will be given by the double integral 

X [ l 1 ~ " I ' 1 w >*)( 1 - a 1 ,') + ...( 21 ). 



S. D. WlCKSKLL 


127 


Introducing the transformation 


6 - 


zjN 

2tri a ’ 


{* 


*.V 

2<x i a ' 


and putting 


2oi s 

ir 


= ti ; 


2^ 

N 


Wi = T 2 , 


we find for the correlation function of ft and ft the somewhat simpler formula 

f (Si> ft) = (2wp|_ 00 |_ot)^ Tl ^ Ta -TaO + ^TiTs] " 2 e-fiV-f,v ...(22). 

It is immediately seen that if r ** 0 this formula reduces to the product of two 
integrals of the form (13), and F(z\ } z%) will then be the product of two Type III 
functions of the form (16). It is also easily seen that for any value of r the 
marginal distributions, 

&); /z(fa)== r«/«. &> («*). 

Jo Jo 


take the form (13). As a matter of fact the characteristic functions of the marginal 
distributions are U (£i, 0) and U (0, t 2 ), respectively. The correlation function (21) 
or (22) is thus of Type III in both its marginals*. 

Now, as our function has only one more arbitrary constant than the Bravais 
function (18), it must be a rather special form of correlation function. As a matter 
of fact it will be found on inspection that both the marginals of (21) have the same 
skewness or, which is the same, that the marginals of the function (22) are identical 
in form when reduced to scales in which the standard deviations are equal. Further- 
more, the regression may be shown to be strictly linear, which, in a way, is a serious 
restriction to the applicability of a skew correlation function. But, on the other 
side, the coefficient of correlation is free and not a function of the marginal con- 
stants f. 

The linearity of regression will most conveniently be shown by finding the 
seminvariants of (22). We evidently have 

- IN log [(1 - h) (1 - h) - ,-V 2 ] = 22 ^ (24). 

Hence we find (besides Xoo = 0), 

(25), 

which gives \ 10 = Aoi - i N ; Xao = X 02 = J A^l 

Y (26) 

Xao 588 X()3 SB N J X40 ** X04 = 3 i\T, J 


* Of course, the same would have been the case if we had taken the correlation snrfaoe of the 
second order moments about the sample means instead of, as here has been done, around the means of 
the parent population. In Biometrika , Yol. xvn. 1928, Professor Pearson has studied the error surfaoe 
of the sample standard deviations <rj and <r 8 . Deriving from this the surfaoe of z x =*, a and * a =r<r 8 s 
a formula very similar to (21) and (22) will be obtained. 

f Op. Earl Pearson: “Notes on Skew Frequency Surfaces,” Biometrika , Vol. xv. pp. 222 f. 



128 


On Correlation Functions of Type III 


Further, we easily derive the general formula 

\ k i = \u=hNk\r* *..(27), 

which gives Xu — \ N r 2 , 

Xgi 585 X 12 * Nr\ 

X 31 = Xi 3 = 3Nt*. 

For X 22 we find the particular value 

X 22 = N (2r 2 + r 4 ). 

As the general criterion for linear regression is* 

XjbiXao = X^oXu, Xn.Xo2 = Xo^+iXn (28), 

respectively, for the two regressions on £i, and £i on f 2 , it is seen that both the 
regressions of (22) are linear. The coefficient of correlation of fi and {2 ( as well aw 
of zi and z 2 ) is 

Xn 


P = 


VX 20 ^0 


.(29), 


whence we see that we have psr 2 (30). 

The regression lines in the plane of ft, f 2 are consequently given by the equations 

fc^+r^-pr)! 

h 

In the distribution of zi and z 2 the seminvariants are clearly 

\n=\NQc- l)!(-p*)‘ (32), 

X tl = \Nk\r^f) k ^ (33). 

v S 

Xio^o-j 2 ; Xoi = <T2 8 j 


Hence we have 


2<7i 4 

~N * 


X 20 = ~xr ; X 11 = 


2oi 2 <7 2 2 r 2 

y— 5 


X 02 = 


N 9 


x - 8cri8 * x 

^80 "" Aj 2 ) ^21 


8<r 2 \ 


A 2 

48o- a 8 
' N s > 


J^Wo. . 8 (Ti 2 <T 2 4 a 

— jy 2 I* * ^12 ** ~ 2 r > X03 jy 2 , 


X 4 O 7iT3 } XjJl 


48ct 1 6 <t 2 2 ^ 16^i 4 <r 2 4 (2r 2 + r 4 ) . 


X 22 

48ct 1 V 2 8 2 

a. 13= — »/a r > 


iV-* 

JV» 


iV 8 


04 s 


*n<7 2 
TV 3 • 


The regression equations in the , z t plane are thus 

4 

h = <r,* + r* “ 4 in - <r x *) 

h = ffi 2 + («* - ff* 2 ) 

<7 2 

* This formula may be generally known only when the \ kl denote central moment*. It may, however, 

be shown that it is valid also in the oase of seminvariants. 


.(34). 



8. D. WlOKSHLL 


129 


The regression of the means may also be deduced in another way from the 
characteristic function, and this method has the merit to be applicable also in 
finding the regressions of the higher order moments in the arrays (the scedasticity, 
the clisy, etc.). Writing any correlation function in the form 


f(*,y)=f(*)px(y) (35). * 

Px (y) is the relative frequency function of y in an #-array. The moments about 
a fixed point in the ^-arrays we denote by 

V (*)=J(/yy n /) I (y) (36). 

If now the above form of f(x ) y) is inserted in (2) we get 

u (t ll tn) = jdxe*'if(x) jdye^iptiy) (37), 

whence we find that 

[ 0n ^”j , , - j («) ( 38 )- 


Using the Fourier theorem we thus get the important formula* 

W 

Now we have, in the case here in question, 

- ‘‘ »<*+*><*+«>•■•<»+ s<»- D)...(4°). 

L. 0T a -k-» 2'*(1 -ti) 4 

Thus we got, remembering (13*), 

f » (&>«/ (6) « X(N+ 2 ) (N + 4) ... (N + 2 (« - 1)) 1 [/„ (&) - (J) rYWft) 

+ g) *(6) + (- i)*i*/ ( "W6>] -(41). 

But it will be easily verified that we always have, as already remarked in § 3, 

(— PaiZi) (42), 

where J\(£ i) is a polynomial of the ??th degree in £i. Hence we find that we havo 

V(ri) = ^(W+2)(W + 4)...(ilT+2(n-l)) ! ( (”)r*«P,(ri)-(43), 

and it is seen that the array moment of the nth order, when taken around a fixed 
point (the total mean for instance), is a whole rational function (of the nth degree) 
of the independent variable. 

Carrying out the development for n » 1 and n - 2 we now got for the regression 
of the mean 


* By the aid of this formula many regression problems can be readily solved, even when the correla- 
tion function is not explicitly given. In a forthcoming paper I have used the formula in developing 
regression formulae, depending on the marginal frequency function /(.r) and the seminvariants \ kl 
only, thus allowing us to take a full advantage of the theory of univariate frequency funotions, in 
particular that of Pearson. 

Biometrika xxv 


9 



180 


On Correlation Functions of Type III 


in accordance with (31), and for the regression of the variance (scedasticity) 

* (6) = V (6) - [V (&)]* - (1 - r*) [IN (1 - r 2 ) + 

It is consequently seen that the central moment of the second degree (the variance) 
in an array is a linear function of the independent variate. The scedasticity as 
* measured by the variance is thus linear as well as the regression of the mean. 

Finally, it may be pointed out that although the marginals of our correlation 
function are both of Type III this is, except when r = 0, not the case with the 
arrays. It will easily be seen that the distribution of an array must be equal to 
the sample distribution of the second order moment taken around a point which 
does not coincide with the mean (except when r = 0). Thus this distribution is of 
the generalised Type III given in § 3 (Eqs. (12*) and (16**)). We may conclude 
that (21) and (22) generally cannot, except as an infinite series, be expressed by 
elementary functions. This follows from the fact that a correlation function is 
always expressible as the product of the marginal and the array distribution. 

In order to solve the integrals (21) or (22), thus to be able to compute f(z\> £«), 
we shall therefore have to use expansions of one sort or another. Such expansions 
have at my request been developed by Mr Tage Larsson. One such expansion is 

obtained by putting z-z it <P=o * in (16) and z = z 2 , <r 2 = <r a 2 (l -r 2 ), a 2 = ^ rhg 
in (16**), and multiplying the two expressions. 

From this development it follows, on taking account of (17*), that not only the 
mean and variance but also the seminvariants of higher order of z 2 in a zi- array 
are linear functions of zg. 

Our correlation function thus has the property that all the array-seminvariants 
have linear regression. 

k» 

6. The problem treated in the preceding section led to correlation functions of 
Type III, but of a rather special kind, the number of arbitrary parameters boing 
only one more than for normal correlation, the skewness being the Bame in both 
marginals. 

In order to increase the number of parameters, in particular to get a surface 
with different degrees of skewness in the two marginals, we shall give the problem 
in the following somewhat generalized form. 


Problent 2. From a normally correlated supply samples of n + n a x’a and n + n% 
y ' s are taken. The n first sample values are taken from individuals for which both 
x and y are given (n pairs of x and y are first taken). The remaining % x’s are 
taken from individuals for which y is not observed and the n% remaining y ' s from 
individuals for which x is not observed. Find the bivariate distribution of the 
sample moments 

£i = — ; — , and z t = —f—, 
n + «x n + ng 

x and y being reckoned from the mean of the supply. 



S. D. WlCKSELL 


131 


In this case, as may easily bo verified, the characteristic function takes the form 


U{k, U)- \(l - 2ai * h) (l - - 2<ra ‘ t,) -r* . 

LV n + Wi/V n + n 2 V (n + 7ii)(n -f- w 2 ) 


Introducing the variables 




_n + n 1 _n + w. 

?i- 2 ffl s 6“ 2 <r* a *’ 


/ 1 — -- . 7 9 = . 

n + n x w4-« 2 

we have for the characteristic function of the bivariate distribution of £i and ft, 

U (ti, t 2 )« [(1 - Ti) (1 - r 2 ) - rVir*] ~ 2 [1 - n] ~ 2 [1 - tJ ~ 2 ...(45). 
Thus we have 

j rm roo n 

/(Si. ft) = ^ 27r )« du’idwi t(l - Wi i) (1 - w t i) + r*WiW a ] ~ 2 

x [1 — w x i] 2[1— w^i] 2 ...(46). 

In this function there arc three more arbitrary parameters than in the Bravais 
function and two more arbitrary parameters than in (22). 

The seminvariants are here evidently, as far as the marginal seminvariants are 
concerned, given by the formulae 

X w * n -+3(*_ 1) !; A 0t = ” + ” 2 (*-!)! (47). 


For the mixed seminvariants we evidently have the same formulae as in § 3, i.e. 


The criteria for linear regression, 




^*1^20 = ^*+1, 0^11. 

^life ^-02 =* ^0, Jfc+1 ^11 ) 

are here evidently also fulfilled and thus also here the regression is linear. A further 
calculation will show that the scedasticity as given by the variance is of the second 
degree in z Xt except for fti = 0, when it is linear. The coefficient of correlation 
between ft and ft (and also between z x and z x ) is 


_ t f n * 

) r V (n -f nj) (n i 


' ' V (n+m)(»+fii) ^ 

But the skewness is, if n x and n% are not equal, not the same in both marginals. 



132 


On Correlation Functions of Type III 


/3io = 


^30 


Xao 8 n + »i ’ 


We have indeed 
and 

Expulsions for the computation of (46) have also been developed by Mr Larsson. 


o W J* 

^ 01 ~X02 8 ~W + K a ' 


6. Another interesting correlation surface will be obtained if we solve the follow- 
ing Third Problem : To find the correlation of Zi — ^-Xk 2 and = if #<, yi 

are N pairs taken at random from a normally distributed supply (w and y being 
deviations from the mean of the supply). This surface will evidently be of Type III 
in one margin and of normal type in the other. Its characteristic function is easily 
obtained. Evidently 



Si <r a <’YV 

1 »/ 1 
> + ‘'A + '-A 




(50 

), 

2 V 2 2 *2 8 °V 

,* “V _'l_ 



2 N^' 2 2V 

AT , ifcr,* 




X ‘ A *« 

(51). 


which gives U ^ 2 e 

If we write 

it is seen that U (t, t 2 ) = (1 - t x ) e^ Nr ~ f ^ Ta r T i (52) 


„ 1 N Nz 2 

^ ~ 2 cri* Zl> V ’ 


is the characteristic function of the correlation of f and tj. Hence we have in this 
case 

Vl- N I f 00 f°° J J /I - T w 'i f 1 + r “ r-— -. I - £W] i ~ r}W„ i 

l (£> v) “ (2V) 2 J _oo J - J^ Wl( ^ Wt ( 1 - l) ^ * L ^ m 

which function may easily be expanded in a series suited for numerical computation. 
So, for instance, we may write the characteristic function in the form 

U(r i, r t ) - e* T ' (1 - n) ‘ ? I ~ (f r*)* t,*( 1 - nT* (54), 

and it follows that if wc put 


fs (£, v ) 1 


*_i _ t _e 

■ p e * 2 


■(55), 


the correlation function of £ and rj can be expanded in the series 

oo 1 7jSk 

fit V) - S o Jj (i^)* /w(fc^) (68). 

which is a combined Charlier and Romanowsky series (with given coefficients). 



8. D. WlCKSELL 


133 


Using the same methods as in the previous sections we find the following values 
of the seminvariants : 

\oi = 0 (except when l — 2) 

Xw = N 1 ( 57 ). 

\h = 0 

\« = 0 (when 3) 

By the same methods as in the previous sections we further easily find that 

V( = 0, 

( 58 ). 

We thus see that the regression of y on f is constant (non-regression), and that 
the regression of £ on rj is parabolic of the second degree. Similarly, the scedasticity 
as expressed by the variance of r/ in the ^-arrays will be found to be linear and the 
scedasticity as expressed by the variance of \ in the ^-arrays to be parabolic of 
the second degree. 



ON THE PARENT POPULATION WITH INDEPENDENT 
VARIATES WHICH GIVES THE MINIMUM VALUE 
OF 4> 2 FOR A GIVEN SAMPLE. 

By KARL PEARSON. 

(1) This paper arises from a very bad blunder made by me in the last issue of 
Biometrika, Vol XXIV. pp. 461 — 463. It has probably been noticed by others, and 
I hasten to correct it, for I recognised my error as soon as the printed Journal was 
in my hands. 

My problem was the following : Given that a sample in the form of a bivariate 
contingency table has been drawn from a parent population, what is the best form 
of parent population to take on the assumption that the variates are not correlated 
in that population ? 

Clearly wo ought to take that form which will cause the mean square con- 
tingency in the sample to be a minimum. We will denote this mean squared 
contingency by 0 a . What choice shall we make of the relative frequencies p H and 
q t of the 8th and tth categories of the two variates in the parent population in 
order that we may have the maximum probability that the two variates in the 
sample come from an uncorrelated parent population ? The fallacious proof referred 
to professed to show that p g and q t should have the values provided by the marginal 
totals of the sample. 

(2) Let us suppose we have a parent population classed according to two 
independent variates and that p g is the probability of drawing an individual of the 
sth category of the first variate, and q t the chance of drawing an individual of the 
<th category of the second variate. Then the chance of an individual combining 
both these categories being drawn will be p g q t . 

Now, if we have a sample of size N, containing n gt individuals in the 5th-tfth 
category drawn from this population, the mean square contingency <j> 2 will be given by 



Here <f> 2 is a measure of deviation from the assumed independent variate popula- 
tion, and it is usual to take Np g »n $ , t and Nqt**n. t > where n g , and n A are the 
totals of the individuals occurring in the sample with the «th category of the first 
and the tth category of the second variate. This is to assume that the sample 
adequately represents the parent population as far as the totals in the various 
categories reached. But if the parent population be unknown, and we are desirous 



Karl Pearson 


135 


of determining whether the variates in the sample are or are not independent, 
then it would appear that we ought to choose the quantities#, and q t so that we 
have the greatest probability of their independence. In other words we ought to 
choose p 8 and q t so that <f>* is a minimum. 

Let us write n 8t JN = , so that u 8t is a relative frequency in the sample, and 


8(u at ) = 1 (ii). 

8, t 

Then 1 +<f> a = s(-^) (iii), 

while 2 ( 2 >,) = 1, 2(g , t )=l (iv). 

’ 8 t 


We need to find a minimum value of <f> 2 in (iii), subject to the conditions (iv). 
Let <£ 2 represent this minimum value. 

We have in the usual manner 


Stf> 2 = - 2 - , 2 Sp„ - 2 \ 2 -* * Bqt, 

r * p. 2 1 qt t qt 2 . p >. 

2 Sp H = 0, 2 Sq t = 0. 

a t 

Or, by the aid of indeterminate multipliers \i and A*, we find 


1 V U 8t 


— *i> 


— , 2 — = \ 8 , 
qt 2 * p» 


.(v). 


P8 t <Jt 

Multiplying these respectively by p t and q H and then summing respectively for 
s and t we have, by (iv), w w 

Ai = 1 + <f>\ X 2 * 1 + 0 2 

and accordingly, writing a 8 2 = 2 , b t 2 = 2 — , 

t V* a Pa 

both right-hand sides being positive quantities, 

+ <fi i =p g , b t /*/ 1 + <j> 2 — q t ( vbis ). 

If the number of the categories of the first variate be m and of the second m\ 
we have m + m' equations, which are, however, owing to the relations (iv) not 
independent. It is interesting to note that this dependence is illustrated by writing 
(v) in the form 

p, = 2 ^ /(I + £*), q. = 2 M “ 2 /(I + 4> % ), 
tp.qt/ r t.p»qt/ 

contribution to 1 + <t> 2 of 5th row 

nr n = — - .. ~ 


9«“ 


sum of contributions to 1 + <£ 2 of all rows 
contribution to 1 4-^> 2 of £th column 

- — — — r ^ 

sum of contributions to 1 -f </>* of all columns 


.(vi). 


These conditions of the values of p § and q t to give the minimum <f > * will not as 
a rule be satisfied by taking # » and q t * -^r , but they will be satisfied in the 
case when there is complete independence in the sample itself i.e. n si ^n gm n tt /N. 



136 


Parent Population with Independent Variates 


Equations (v) present considerable difficulties in solution in the general case, 
which it is to be hoped some competent mathematician will overcome. They 
appear to lead to very high order equations. 

We can, however, illustrate the matter on a simple case, that of a fourfold table. 

wai 

Nqi 


We have from (v) 


?? 12 

Np i 

7/22 

Np-t 

Nq t 

N 


+ - <ia =pi 2 (l +<£*), or p x 
9i 9* 


t'«* Vui a ,, , Vo, 

„ +■ = p 2 2 (l+<£ 2 ). or ih 

?i 9* 


V + ^! = ?1 * ( l + ^), or 
Pi Pi 


+ or 9* 

Pi Pi 


/ «u*/9i + »i2 2 /9* 'l 

r;* 

/W21 2 /?l + «**/?* 

^ l +<£ 2 

„ /"nlpi + Wa */?>2 

s ‘*v ' li 

/ V/jpi + 

^ 1 + 




(vii). 


Accordingly 


.(viii). 


+ «M 2 //'2 

w 

</>* - 

P2 2 _?2«21 2 + (/l«22 2 
Pi* ?2«ll 2 + 9l«12 2 

Substituting for and </ 8 from the last two equations on the right, we have 


© 


^21 


p* v pi 


pi 


Mil 


/^i 

V », 


«n 2 + «21 2 

Pi 


v 1 ?, 


....(ix). 


+ s 

pi pa “ v pi pa 
Writing ^ 2 = 2 , we obtain on rationalising an equation of the 10th order to find 

z t and since pi+pa = l, theoretically the problem is solved, for the same process 
may be repeated on the g/s. 

We can, however, shorten the process by using the 6 f ’s of Equation (v). For 



= »21 2 ^ +J'J* 2 &1 

Vpi/ Wll 2 ^ + «12 2 /»1 


where 

V = M ”V< ^““V- 22 - 

pl Pa Pl P2 

(x). 


or 


Solve these equations for - and — . We find 
1 Pi pa 

1 __ b 2 u& — Vwn 2 1_ - — b- f uii 

Pl ~~ Vii 2 i/22 2 ~ ^18* ’ p% ~ ?/ U a i/82 2 - W 18 2 If n* 

bi 2 Uj& 2 — 6* 2 Wa! 8 

Pl bi 2 u u 2 ~ b^vn 1 


.(xi). 



Karl Pearson 


137 


Now write bi/fa - X, then 

Pt 1 
Pi * Pi 


or 


Pi' 


jv-y 
,A ~ V- jrW’ 

«li a -^12 a 


.(xii). 


Wu 2 - ^ 21 2 - X 2 (u 12 2 - u& 2 ) 

This gives p x when X is known. Now, returning to Equation (viii) and using (x) 
and (xi), we have 

/ X 2 U & 2 - W&V _ Wa! 2 + U& 2 X 

W-x*i<i*V ” u n 2 TuuFx f 

which on expansion leads to 

X* Ui2 2 W22 2 ( ^U2 2 - «12 2 ) + X 4 (M22 4 ^ll 2 “ «12 4 l/gl 2 ) 

- 2X*U U 2 u 22 2 (u n 2 - u n 2 ) - 2X 2 V 2 X 2 V (u^ - u ia 2 ) 


4 X (W 21 4 u 12 2 - Un'uv?) + Uv? u n 2 (n 2 i 2 - u n 2 ) = 0 (xiii). 

The appropriate root of X being found from (xiii), (xii) will then give pi and 
pi being known p 2 ~l—pi. Hence by (viii) we have the ratio q 2 /qi*, and since 
r/ 2 4 (/i = 1 , we find q 2 . Lastly, from pi , p 2) qi and q 2 we can find $ 2 , and so complete 
the problem. 


(3) Illustrations from Tetrachoric Tables. 

We may illustrate this first on the following example, showing the relation of 
Intelligence to Athletic capacity in 1708 schoolboys. The decimals arise from 
boys placed on the boundary lines. 



“Intelligent” 
and above 

“Slow Intelligent” 
and below 

Totals 

, Athletic 

581*25 

500*75 

1148 

Non -Athletic 

209*25 

350*75 

660 

Totals 

700*5 

917*5 

1708 


Treated as a fourfold table the correlation of Intelligence with Athletic Capacity 
is ‘2035. But this result really measures the correlation within this particular 
sample. We may inquire what is the probability of this as’a sample : 

(а) From a parent population with independent variates, determined by the 
marginal totals of the sample itself. 

(б) From the most probable independent variates’ parent population. 

To answer these questions we first rewrite the table in terms of us, thus : 


un = -3403,1030 u 12 = 3318,2085 

a* » 1225,1171 it* = *2053,5714 


mi.= ‘6721,3115 
« 2 .= *3278,6885 


M.i** ‘4628,2201 


Total « 1*0000,0000 


«.a = ‘5371,7799 

* We have X~bjb a ?=q l lq 1 by the last two equations of (vii), or 

?I = X/(HX), ? 3 = 1/(1 + X) (xiv). 






138 


Parent Population with Independent Variates 


(a) Here w*. — ?!, u.i~qi, u m2 *=q 2} if we take the independent parent 

population to have its probabilities determined by the marginal totals of the 
sample. Call the resulting <f> % , <£i 2 . Then 


lw- -*■£-+ 

ti.itti. tt.itif. w.aUa. 

= -3722,9068 + *3049,5454 + *0989,1020 + *2394,4251 
* 1*0155,9793. 

Hence <£i 2 = 0155,9793 and ® iV r ^> 2 = 26*6413. The only constraint is the 
total size of the sample, and in a second sample u x , and u 2 . would differ. Thus 
we interpolate for our ^ 2 with 4 in the Table* and we find P = *000,007, and 
accordingly it is highly improbable that such a concentration of intelligence and 
athletic capacity could have been drawn from a parent population in which tho two 
characters were independent provided that parent population had the same relative 
proportions of intelligence and athletic capacity as are shown in the sample. 

But why should we limit possible parent populations having these two 
characters independent to this particular case ? Rather we ought to seek for the 
most likely population of independent variates from which the sample might have 
been drawn. 


(6) To solve this problem we must solve Equation (xiii) for our particular jsase, 
and then use (xii) and (xiv) to find p\ and q x which for the minimum value <£ 2 we 
will term pi and $i. We have 

tin 1 - -1158,1110, w 12 2 = 1101,0508, W21 a = *0150,091 2, u& 2 - *0421,7156, 
u u 4 « *0134,1221, V- *0121,2313, ^ = *0002,2527, Waa 4 = ‘0017,7844, 
whence we readily find, on substituting in (xiii), 

- *0003,1 5436 A r 6 + *0000,24005X 4 + *0009,36108X 3 + *0002,$6167X 2 

- *0005,40811 A - *0001,75216 = 0, 

or in a more convenient form 

31*5436 A 5 - 2*4005A 4 - 93*6108** - 23*6167Z a + 54*0811* + 17*5216 « 0. 
This equation has a positive root between 0 and 1 approaching the latter value. 
Two approximations by Newton’s method gave 

A =*864,7955, 

whence q x » *4637,4817, 92 = ‘5362,5183, 

since by (xiv) 91/92 = X , or 91 — */( 1 + X), 

From (xii) we find 

pi - *6694,0856, p 2 - ‘3305,9144. 

Thence we have 1 + ^ 2 « 1*0155,6244, 

and accordingly = 1708 x *0155,6244 = 26 5806. 

This marks a slight increase of probability on the parent population based on 
the marginal totals, but the increase would have in this case small importance for 
practical statistics. 


Tablet for Biometricians and Statisticians , Part I, p. 26. 



Karl Pearson 


139 


I will take as a second illustration of the method the following fourfold table 
giving a tetrachoric correlation between *3 and *4. 



A 

Not A 

Totals 

B 

270 

88 

358 

Not B 

33 

33 

m 

Total* 

303 

121 

424 


To obtain an accurate equation for X, ten figures were taken so that the 
relative frequencies were as follows : 


Mil 

6307,924,528 

« 21 

•0778,301,887 

W , 2 

•2075,471,698 

W22 

•0778,301,887 

•8443,396,226 

Vi. 

•1556,603,774 

“i 

•7146,226,415 

•2853,773,585 

1 0000,000,000 


Here 

Mu 2 = 4055,046,279, V = 0430,758,277, u n 2 = V = 0060,57 5, ,383. 
Whence the ordinary </> a is given by 

l + <£ 2 = 10416,404,766, 

X* = 424 x *0416,404,766 * 17*66, 
and P = -000,531. 

This results from assuming the parent population has the class probabilities given 
by the marginal totals of the sample itself. We now leave the marginal frequencies 
to be found so that they give a minimum ^ a ; we need the following values: 

w u 4 = *1644,340,032, u n * = -0018,555,269, V - -0000,366,931 - t/** 4 
Hence we obtain the quintic equation for X , 

9,659,306X 6 -3,639,581X 4 -- 208,459,178X 3 - 181,860,477X 2 - 994,484,656X 

+ 981,190,690 = 0. 

This equation has a positive root between 2 and 3, and another between 4 and 5. 
The latter gives a high value of <f > a The former root is 

X = 2-442,129,416, 
and as this equals qi/q*, we have 

qi « *709,482,161 , ? 2 - *290,517,839. 



140 


Parent Population with Independent Variates 


( 9 * 


•0408,278,338, 


Proceeding now to find the ratio p 2 /pi> we have from (ix) 6 ^ 

% ^0208,508^308 

: •5107,013,738 ' 
which gives *831,905,931, p 2 = *168,094,069. 

These values of the jp’s and q* s provide 

l+# 9 » 1*040,104,809, and « 424<£ 2 * 17 00, 
giving P = *000,707. 

The change, as in the previous case, increases the value of P, but only slightly. 
Thus here again the parent population of maximum probability seems scarcely 
worth the labour of computing. 

Lastly, I took a third case, in which the tetrachoric correlation was very low, 
namely the distribution of Boys and Girls by their hair colour, in two categories, 
Dark and Light. 

The table is as follows : 

Hair Colour . 


Sex 

Dark 

Light 

Totals 

Girls 

529 

170 

705 

Boys 

442 

148 

590 

Totals 

971 

324 

1295 


This gives us for the relative frequencies: 


«ll 

« 12 

Ml. 

•4084,9420,84 

•1359,0733,59 

•5444,0154,43 

Wjjl 

W 22 

M 2 . 

•8413,1274,12 

•1142,8571,43 

*4555,9845,57 

«.i 



•7498, 0095, 06 

•2501,9304,94 

1*0000,0000,00 


Whence with marginal totals we find 

1 + * 1*0000,0190,8 or X * * 0000,019,08 x 1295 = *0024,7086, 

which gives a high probability that the hair colour in these groups of boys and 
girls is independent of sex. 

We now turn to the most probable parent population. It is necessary to work 
to a large number of decimal places, because the population of maximum likeli- 
hood is so close to that of the marginal totals. We have 

w„ 2 * •] 668,6751,80, v u 4 - *0278,4476,86, 


w* 1 ® 1164,9438,72, ?/ 2 i 4 

f/i2 2 = *0184,7080,40, 

v& 2 — 0130,6122,44, V 


- *0135,7094,22, 
•0003,4117,08, 
: 0001,7059,56. 



Karl Pearson 


141 


From these values we deduce the quintic 
130*506, 820Z 6 + 1127 761, 551X 4 - 2430*51 6,81 8 X 3 - 2, 1031-503, 410Z* 

+ 1, 1302-055, 15HX + 9,7920*980,050 - 0. 

' Localising a root near three, we find by a double Newtonian approximation 

X = 2-9969,2335. 

It may be doubted whether we can approximate closer to the root without carrying 
the coefficients of the quintic to more places of figures. 

From the relation q\tq% = X we find 

qx = *7498,0756, q 2 - -2501,9244. 

Then from Equation (viii) we deduce 

pi a* *5444,0146, p 2 = *4555,9854. 

We sec from these results how little the p’s and q * s have been modified by 
seeking the most probable parent population. 

Substituting in the expression for 1 -f <£ a , we have 

l + <£ a = 10000,0190,8 

exactly as before, or ^ 2 = *0024,7086. 

Probably the last figure in 1 + <£ 2 is untrustworthy, and it is not possible without 
more labour than the matter is worth to distinguish between the marginal totals* 
parent population and the most probable parent populations. 

To judge by the results of the three tables here discussed, it is not likely that, 
with the restricted freedom of a fourfold table, we shall obtain a substantially 
higher degree of the probability that two series come from the same parent 
population, when we take that population to be the most probable population 
rather than base it on the marginal totals. 

(4) Theory for 2 x n Tables. 

In the case of Biserial Tables, where the two series are both supposed to have 
totals due only to random sampling, the equations become harder of solution ; and 
thus far I have only reached such solution by a method of approximation. We 
will suppose the tables reduced to tho form : 


Wll 

Wgl 

W 12 

Wgg 

... 

u u 

Mg g 

. . . 

wi t 

w a « 

... 

Win 

W 2 n 

Pi 

P2 

71 

7* 

... 

q. 

... 

7 > 

... 

7n 

Y 


where px, p 2 , qx, q* ••• q« ••• q% ••• 7n are the chances deduced from the parent 
population which we intend to choose so as to give the minimum <£ 2 to the table. 
Usually they are found as we know from the sums of the rows and columns. Wc 
have supposed the cell frequencies reduced to relative frequencies, their total being 
unity. (f > 2 is the minimum mean square contingency. 



2 Parent Population with Independent Varieties 

By the second equation of (v) we have for the minimum 

/vi* , w* 2 1 

q ‘ V pi p* j i + 

_ / «it* . «*? 1 

V pi Pi Ji+fi 


Wl» 2 + ^ W2« 2 


It follows therefore that g* = 


since 8 (q H )=* 1. 
« - i 


17 

t-1 v 


Mlt® + — «** 


But from the first equation of (v) 

,+ 

!,(?)■ 

Let us write pilp 2 ~ V \ then 

y- 5 ("‘•V »(”“)• 

.-A // *~i\ 9*/ 

Now substitute the value of g, in (xvi) and the denominator will divide out 
and wo have 


*-=1 Vlli* + Yu 2 ? 


.(xvii). 


«-i *J til* 4* Yufr 2 v 

This is the equation to find Y and on its solution we shall have pi and p 2 known, 
and the qs may then be found from (xvi). 


n 7/ 2 __V 2 , y 2 

(xvii) may be written 0= 8 * — _ — . 


.(xviii). 


1 A TU/ IUIV f MV ff ** W¥V4« V ~ i ~ — t «••»« I I f JLM.A f| 

Now assumo F 0 is an approximate value of 7, and take 7=» 7 0 (l+€), using 
Newton s rule to determine an approach to e. We have 

2 8 U -*~ Y °^l_ 

y/lli,?+ Yollto 2 


4 8 -- - ^ a} ° + $ y ° Ma * 2 ( Wl * ~ ^a* 2 )’ 

<*« 1 V tt!* 2 + 7o Uto 2 (Wi, 2 4- 

Hence after some transformation we have 


n 2 __ VSL, 2 
2 $ J 
**=i 7ow& 2 



Karl Pearson 


143 


Now suppose To to be the value due to pi and p%, or (* + 6 )> ^ en 

pi 

substituting for F 0 we find after some reductions 

* o” / “1** WiA /{ u i* . 

— ■■ (->• 

Uu U<te Ujg* 

s~ n t 2 s—n ni a 8 

S -ff ^■■■■■, + 3 8 /£— - T 

* = ! M.® + WfcV 

\Pl Pi/ \Pl Pi) 


Accordingly we take suitable values for pi and p 2 , say those of the actual 
marginal totals, from these we compute u u */pi, Wi, a /pi a , itzf/pt and w 2 g 2 /p 2 *, and then 
form the sums indicated in (xix). Knowing e we determine another pi and p 2 from 

pi/pz=*(l + €)pi/p2, 

together with pi + pa=l. If € does not come out adequately small we must now 
repeat the process using pi, p 2 for pi, p 2 . Next the q' s are found from (xvi), which 
may be written 

A jg 8 ; v * 

^ V Pi Pi 


,=i\Pi 1h, 


.(xvi) w *, 


/here the values of — — and — will already have been calculated. 
p i Pi J 

We can proceed to find <f > * from (xvi) 61 * directly without determining the r/’s. 

_ °-L n /a,,® u.„.M 


T= S ( 

»=i \ 


uu + V 
Pi ~p% 


then clearly 


T*q, = + — ; 

Piq « 


hence summing for s we have since S(q 8 )= 1, 

1 

2% = l+<£ 2 , 

or 1 

leading to %* = N(T 2 - 1), where N is the total frequency of the table. 
We may note that this result can be generalised. By (v) we have 

/ v «, ( *\* 1 

{ft 5=5 (2* ~ j > 

' ' fs/l + <£ 8 



hence 



144 


Thus 


whore 


Thus 


Parent Population with Independent Variates 

(*■(£))' 


„*\\i /_ /uJ\\i 


.(xxi). 


*\\i 


{-;,)) + M^)) + - + ( s -( t ~)) 

- s - ( s - C^*))* ■ 

*-*■(£)/*•• 


\Pn<ltJ 

and l=(l + fi)/T 2 , 

by summing for t Accordingly, as before, 

02 _y* 2 _i (xxii). 

It is thus only necessary to find the p's (or the r/s) in order to determine the 
minimum <£ 2 . 


(5) Illustration. 

I take the following data for Anaemia in the boys of a small L.C.C. School — 
one of the poorest schools of London — see Report of Medical Officer, 1909. The 
ages of the boys were 7 to 18. 


Ages. 



7 

8 

9 

10 

ll 

12 

18 

Totals 

Non-Auaemic 

19 

23 

19 

26 

14 

13 

17 

131 

Anaemic 

34 

40 

28 

40 

33 

34 

27 

k. 

236 

Totals 

53 

i 

63 

47 

66 

! 

47 

47 

44 

307 


We take as our hypothesis the supposition that there is no relation of Anaemia 
to the age of the boys. Now this is a sample from a population of boys frequenting 
schools in districts where lack of employment, improvidence and drink are wide- 
spread. Assuming an indefinitely large parent population, we do not know the 
proportion of anaemic and non-anaemic boys in that population, nor do we know 
what is the exact distribution of ages. Hitherto we have assumed these to be 
given by the marginal totals. 


Let us take p x to be the chance of drawing a non-anaemic boy, p 2 of drawing 
an anaemic boy from the parent population. Let the chance of drawing boys of 
ages 7, 8 , 9, ... 13 be q x , q 2) q *, ... q lt respectively. Then the deviation of our sample 
from the parent population of supposed independent categories will be measured 
by 4>\ where 


l + P 


■S( 


n«t 

\N* Pl 





Karl Pearson 


145 


where 8 is 1 or 2, giving the blood class, and t ranges from 1 to 7. n Ht is the cell 
frequency and $ = size of sample = 367. It is convenient to write u u i**n $t /N t so 
that 

0 2 = 8 (xxiii). 

If as usual we put for p 8 and q t the values deduced from the sample we find the 
table may be written : 



7 

8 

9 

10 

11 

12 

13 

P 


•0617,7112 

•0926,4305 

•0626,7030 

•1089,9183 

; 

•0617,7112 

•0762,9428 

•0708,4469 

•1089,9182 

•0381,0714 

•0899,1826 

•0354,2234 

■0926,4305 

•0463,2153 

•0730,6948 

•3569,4823 

•6430,5177 

'/ 

•1444,1417 

1 

•1716,6213 

•1280,6540 

. - .. 

•1798,3651 

•1280,6540 

•1280,6540 

•1198,9101 

1-0000,0000 


Using the ps and f/s of this table, hereafter to be written p and 5 , we find 

l + <j> 2 = 1*0083,7311, 

and = (jb a x 367 « 3 07 29. 

We now proceed to compute e from formula (xix). As we have to find 

«U* , «t? /Ml* 4 « (Uu* «J, 4 \* 

Pi pi’ \ Pi pi) \ pl pi) ’ 

and " u „ , -K , for each column, the process is somewhat laborious. 

Pi pi 

I give here merely the final sums, in case any reader cares to check the 
arithmetic. We have 



•0101,7747, 


Hence 



-0101,7747 
e * 4 , 973l,6160 


•9730,6644, 


1-0000,3172. 

•0040,9296. 


Biometrika xxv 


10 






Wc have for the approximate minimum <£ a , 

l + £ a =T a = 10083,4600, 
or ^ a = -0083,4600, 

and = 3*0630. 

In this particular case we certainly have made a reduction in the <f > a and % a , but 
not one of any practical importance. The e is already so sina\) that it does not 
seem that we should gain much by a second approximation. 

It cannot, however, be asserted that in other cases the parent population of 
maximum probability may not differ much more considerably than the present 
does from the parent population as represented by the marginal totals. 

We want far more experience of what differences can arise between the two 
ways of approaching the subject. In particular we need the aid of a more com- 
petent mathematician than the present writer to deal with the general case of <f> 2 
for an a x n* table. As far as the present very limited data for two very spocial 
types of contingency tables reach, we might conclude, but hardly safely, that we 
shall not make large errors if we replace the most probable independent variates’ 
population by a marginal totals’ parent population. 



THE SKULLS FROM EXCAVATIONS AT DUNSTABLE, 

BEDFORDSHIRE. 

By DORIS DINGWALL, M.A., B.Sc„ and MATTHEW YOUNG, D.Sc., M.D. 

Introduction . 

With the kind permission of the Conservators of the Dunstable Downs and the 
Stint Holders, the University College and Hospital Anthropological Society, in 
collaboration with T. W. Bagshawe, Esq., F.S.A., undertook the excavation of the 
most northerly of a group of bell barrows known as the Five Knolls, situated three- 
quarters of a mile west of Dunstable, Bedfordshire. The work was begun in 1925 
and was carried on during the four succeeding seasons 1925 — 1929*. The skulls 
are preserved in the Institute of Anatomy, University College, London. 

The barrow is from 50 to 60 ft. in diameter and the primary burial was found to 
be nearly central. It consisted of the skeleton of a woman in the crouched position 
lying on the right side in an oval cist cut in the chalk. This was surrounded by 
a flat-bottomed ditch possibly used for ceremonial purposes and belongs most 
probably to the Early Bronze Age. 

The secondary burials were of cremated bones covered by an inverted urn of 
the Middle Bronze Age type, and a collection of burnt bones in a shallow hollow. 

The tertiary burials were scattered without any clearly defined plan over the 
southern half of the barrow and the adjacent surface of the downs. It is with these 
tertiary burials that this paper is concerned. 

The dating of the skeletons is difficult because they were so close to the surface 
of the mound that it is unsafe to attribute them to the same date as that of the 
objects found in the same layer. The stratum contained a brooch of La T&ne III 
type (probably 100 — 50 B.c.) and a gilded bronze buckle with iron tang of probably 
the 5th century A.D. Besides these there were various objects of Roman, post- 
Roman and Saxon date. 

Mr Dunning and Dr Wheeler f have discussed the archaeological evidence in 
detail and have come to the conclusion that the “ objects associated with the 
burials suggest the 5th or 6th century a.d. for the date of the whole group.” 
Whether their conjecture that the series of skeletons “represents part of a Saxon 
raiding party which had been worsted by the local inhabitants and summarily 
executed” is a correct one will be discussed when the analysis of the cranial 
measurements has been presented. During the excavation there were clear 

# For ft fall report see the Archaeol . Joum. Vol. lxxxviij. 1981, pp. 198 — 217. 
t Op. ciU pp. 205 — 210. 


10—2 



TABLE I. 

Showing the Means and Variabilities of the Characters in the Male series of Skulls . 


Characters 

No. 

Means 

<r 

V 

0 * 

25 

1524*80±21*32+ 

106*59 + 1507t 

6*99 ±0*99+ 

L 

52 

184*92 ± 0-81 

5-84 ± 0*57 

3*16 ±0*31 

F 

51 

182*70 ± 0*91 

6*50± 0*64 

3*56 ±0*35 

B 

52 

145*42 ± 0*73 

5*28+ 0-52 

3*63 ±0*36 

B' 

50 

98*66+ 0*72 

5*07 ± 0*51 

5*14±0*51 

IF 

50 

135*58+ 0*65 

4*63± 0-46 

3*41 ±0*34 

OH 

40 

1 15*25 ± 0*49 

3*32 ± 0*35 

2 *88 ±0*30 

LB 

48 

102*67+ 0*63 

4*34+ 0*44 

4*23 ±0*43 

Q 

44 

317*30 ± 1*44 

9*54 ± 1*02 

3*01 ±0*32 

v 

44 

321*25+ 1*49 

9*91+ 1*06 

3*08 ±0*33 

£ 

50 

374*10+ 1*97 

13*91+ 1*39 

3*72 ±0*37 

St 

50 

128*86+ 0*67 

4*76+ 0*48 

3*69 ±0*37 

S 2 

51 

127*24+ 1*35 

9*63+ 0-95 

7*57 ±0*75 


52 

117*92+ 0*93 

6*71 ± 0*66 

5 09 ±0*56 

St’ 

50 

112*18± 0*58 

4*09+ 0*41 

3 *65 ±0*37 

SJ 

51 

112*74± 0*99 

7*06+ 0*70 

6*26 ±0*62 


52 

97*27+ 0*76 

5*48+ 0*54 

5*63 ±0*55 

u 

51 

527*10+ 1*98 

14*13+ 1*40 

2*68 ±0*27 

OH 

32 

118*03+ 0-93 

5*24 ± 0*66 

4*44 ±0*56 

an 

34 

70*97 + 0*64 

3*74+ 0*45 

5*27 ±0*64 

OB 

33 

96*73+ 0*78 

4*49+ 0*55 

4*64 ±0*57 

j 

15 

138*11+ 1*06 

4*61+ 0-75 

3*34 ±0*54 

NH' 

32 

50*69 ± 0*53 

2*98+ 0*37 

5*88 ±0*74 

NH , It 

34 

51*65+ 0*43 

2*50+ 0*30 

4*84 ±0*59 

NH,L 

34 

51*35+ 0*46 

2 71 * 0*33 

5*28 ±0*64 

i\B 

34 

25*24+ ()-26 

1*54+ 0*19 

6*10± 0*74 

0„ R 

31 

42*81+ 0*28 

1*57+ 0*20 

3*67 ±0*47 

O u L 

34 

42*44+ 0*28 

1*65+ 0*20 

3*89 ±0*47 

0*, R 

31 

33*42+ 0*33 

1*86+ 0*24 

5*57 ±0*71 

0 2 , L 

34 

33*47 ± 0*33 

1*93± 0*23 
1*32± 0*18 

5*77 ±0*70 

o t \ n 

27 

40*74+ 0*25 

3*24 ±0*44 

o. 

20 

51*10+ 0*51 

2*77+ 0*36 

5*42 ±0*71 


27 

46*52+ 0*47 

2*44 ± 0*33 

5*25±0*7l 


36 

40*94 ± 0-43 

2*56+ 0*30 

6*25 ±0*74 

OL 

36 

97*81 ± 0*79 

4*74+ 0*56 

4*85 ±0*57 

fml 

48 

36*71+ 0*33 

2*28+ 0*23 

6*21 ±0*63 

fmb 

43 

30*40+ 0-32 

2*10+ 0*23 

6*91 ±0*75 

100 B/L 

52 

78*69± 0*47 

3*40+ 0*33 

4*32 ±0*42 

100 B/F 

51 

79*83+ 0-51 

3*66± 0*36 

4c58 ± 0*45 

100 H'/L 

100 B/H 

50 

73*40± 0*43 

3*03 ± 0*30 

4*13±0*41 

50 

107‘28± 0*71 

5*05 ± 0*51 

4*71 ±0*47 

100 O’ Hi OB 

32 

73*48+ 0*69 

3-90+ 0*49 

5*31 ±0*66 

ioo ohij 

18 

51*65+ 0*58 

2*46 ± 0*41 

4*76 ±0*79 

100 NB/NH ; R 

34 

48*95 ± 0*56 

3*20+ 0*40 

6*66 ±0*81 

100 NBINH , L 

34 

49*25+ 0*60 

3*48+ 0*42 

7 *07 ±0*86 

100 NBINH ' 

32 

50*03* 0*70 

3*95+ 0*49 

7*90 ±0*99 

100 Ot/O^ R 

31 

78*15 jh 0-87 

4*86 ± 0-62 

6*22 ±0*79 

100 o 2 io u l 

100 OJO x \ R 

34 

78*86+ 0*90 

5*27 ± 0*64 

6*68 ±0*81 

27 

81*57+ 0*92 

4*78 ± 0*65 

5 *86 ±0*80 

100 fmb/fml 

ioo a 2 /a l 

43 

29 

82*94 ± 0*98 
80*44 * 0*91 

6*40 ± 0*69 
4*91 ± 0*64 

7 *72 ±0*83 

6- 10 ±0*80 

Pl 

26 

88°*04± 0*65 

3°*30± 0*46 

3°*75±0*52 

N L 

33 

64°*17± 0*58 

3°*32± 0*41 

5°*17 ±0*64 

A L 

33 

74° *62 ± 0*47 

2°*72± 0*33 

3*65 + 0*45 

Bl 

33 

41**21 ± 0*42 

2° *40* 0*30 

5°*82 + 0*72 

6\L 

26 

27°*44± 0*71 

3 y *62+ 0*50 

13°*19± 1*83 


26 

13°*50± 0*75 

3°*8l + 0*53 

28°*22±3*91 

Oc. /. 

50 

59*30 ± 0*35 

2*50± 0*25 

4*22 ±0*42 


* A T oce on Cubic Capacity. The mean cubic capacity of 1524-8 c.c. based on 25 Bkulls was estimated 

by filling the Hkulls with mustard seed and measuring the amount contained in the graduated oylinder. 
The mean cubio content was also estimated by transforming the weight in grams of firmly packed 
mustard seed which eaoh Bkull could contain into cubic centimetres, the volume of seed when tightly 
packed which corresponded to 1000 grams having previously been determined by means of the crdnc 
Stalon (Maodonell’s method). The mean capacity thus found was 1495 c.c. The slight difference in the 
mean estimates by the two methods is probably to be explained by the fact that owing to the fragilo 
condition of some of the Bkulis it was not possible in these to use such firm pressure in packing the 
seed as in the crdnc Staton. The mean capacity of the 25 skulls computed from the mean length, 
breadth and height by Miss Hooke’s formula is 1583 ±9*2 c.c. 



Doris Dingwall and Matthew Young 149 

indications that the burials had not all taken place at the same time, and the 
haphazard nature of most of them was emphasised by disturbance and overlapping. 
A certain number were in trench graves, and in one of these an iron buckle was 
found in close association with a skeleton. It is of a type common in Roman and 
later times. There is nothing, however, to show that the trench graves and super- 
ficial burials belong to different periods although they probably took place at 
slightly different times. 

An interesting feature of the group is that about one-tenth of the skeletons had 
their arms crossed either, behind their backs or before their chests. Such a posture 
suggests that they had been bound at death and this, coupled with the casual 
nature of the burials, points to the conclusion that the site is not a formal 
cemetery. 

The Crania : Means and Variabilities. 

From the total series of approximately 100 skeletons, about 64 skulls, 52 male 
and 12 female were, after a certain degree of reconstruction, in a suitable condition 
for obtaining most of the principal measurements. The sexing of these skulls can 
be relied upon, as it was in almost all cases verified from an examination of the 
pelvic bones. The means and variabilities of the cranial characters of the male 
series are shown in Table I and the means of some of the principal characters 
of the short female series in Table II. As the variabilities shown in the length, 
breadth, height and cephalic index of the male skulls are not greater than those 
shown by the corresponding characters in the Farringdon Street series of 17th 
Century Londoners which were taken from one graveyard, or in the contemporary 
Whitechapel series collected from a single pit, there is no reason to regard the 
present series as other than fairly homogeneous. The methods of measurement of 

TABLE II. 

Shomng the Mean Values of some of the Principal Characters in the 
Short Female Series. 


Characters 

No. 

Means 

L 

12 

1791 

F 

12 

178-3 

B 

12 

142-9 

w 

12 

96*3 

H' 

11’ 

126*9 

OH 

11 

110-5 

LB 

11 

95*1 

U 

11 

516-2 

S 

11 

368-3 

9 

11 

312-2 

WO B/L 

12 

79-9 

100 H'jL 

100 B\W 

11 

71-1 

11 

112-7 





150 


Skulls from Excavations at Dunstable 

the several cranial characters practised in the Biometric Laboratory were closely 
adhered to, and the symbols by which the characters are represented in the tables 
are those described and used in the various craniological memoirs published in 
Biometrika*, to which reference may be made. 

The Affinities of the Skulls: Coefficients of Racial Likeness. 

The archaeological evidence being very fragmentary and not at all convincing, 
the first point to be determiner! was the racial type to. which the skulls as a group 
most closely conformed. For this purpose Professor Pearson’s Coefficient of Racial 
Likeness f was used. The available evidence seemed to suggest that the skulls 
were those of Anglo-Saxons ( vide Dunning and Wheeler), but a mean cephalic index 
of approximately 79 for males and for females did not support the view that the 
skulls conformed to the type shown by well-authenticated Anglo-Saxon skulls such 
as those preserved in the London Museums which have been described by MorantJ, 
the Bidford-on-Avon collection described by Brash § or the Burwell || collection in 
the Anatomy Department at Cambridge. 


TABLE III. 

Coefficients of Racial Likeness between the Dunstable Group and other Groups 

of Male Crania. 

Dunstable (39‘8). 



Crude coefficients 

Reduced coefficients 

(«, =7*2= 100) 

All characters 

Indices and 
angles 

AH 

characters 

Indices and 
angles 

English Bronze Age (27*2) 
Anglo-Saxon ... (36*2) 

Hythe (101*8) 

British Iron Age (50*8) 

Whitechapel ... (02*3) 

British Neolithic (28*9) 

3*86 ± *18* 
5*18± *17 
7*18± *17 
8*40 ±*20 
11*50±*18 
17*93± *18 

'29] 

30 

30’ 

22* 

’29’ 

’27] 

«« 

3*12± *29 

7 *07 ±*28 
8*80±*28 
13*20± *36 
14*76± *29 
26*28 ±*30 

11 

12’ 

12 

p: 

ii 

;i°; 

11 *94 ±*56* 
13*66 ±*45 
12*55 ±*30 
18*82 ±*46 
20*68 ±*32 
53*65 ±*54 

i3-i6±i-2sf 
19-77 ±0-78 
16-07 +0-51 
31 -37 ±0-86 
28-31 ±0-66 
11 3-65 ±1-30 


* See more especially Vole. i. pp. 416—418 ; m. pp. 199—207 and xiv. pp. 200—201. 

t Biometrika, Vol. xvi. 1924, pp. 11—14 and Vol. xvni. 1926, pp. 105—117. The standard deviations 
used in Table III to form the O.B.L.’s are those for the long Egyptian Series E of the 26th— 80th 
Dynasties, provided by Davin and Pearson in Biometrika, Vol. xvx. 1924, p. 888. 

t Ibid. Vol. xvm. 1926, pp. 65—98. 

8 Archaeologia, Vol. lxxiii. 1922—28, p. 106 (Appendix I). 

II This oolleetion has been measured by Doris Dingwall, but the mean measurements have not yet 
been published. 

# * Throughout this table the quantities following the ± sign are “probable” not “standard” 
errors. 



Doris Dingwall and Matthew Young 


151 


The male skulls were thus compared not only with an Anglo-Saxon* series but 
also with the English Bronze Age series, the British Iron Age* series, the British 
Neolithic* series, the Hythe craniaf and the 17th Century Londoners from 
Whitechapel J. For the comparable figures for these several types we are indebted 
to the authors of the memoirs containing the data relating thereto which have 
been published in Biometrika. The crude coefficients of racial likeness between 
the Dunstable male group and these other groups are shown in Table III. The 
reduced coefficients which result from an adjustment that is made to allow for the 
variation in numbers of skulls available in the different groups are also included in 
the tabic. These coefficients are relatively comparable. As the mean numbers of 
skulls available for comparison in the Bronze Age and Anglo-Saxon groups are not 
very different, the crude coefficients of racial likeness between the Dunstable skulls 
and these groups may in the first place be compared directly. Such a comparison 
suggests that the Dunstable skulls are more closely related to the Bronze Age type 
thau to the Anglo-Saxon type. For shape characters (indices and angles) alone the 
coefficient in the former case is 31 and in the latter 7T. For all characters the 
difference is not so evident as shown by coefficients of 3 9 and 5‘2. When due 
allowance is made for the difference in number of Bronze Age and Anglo-Saxon 
skulls available for comparison, the difference in relative closeness of relationship 
just described is confirmed. In general form or shape the Dunstable skulls clearly 
resemble Bronze Age more nearly than Anglo-Saxon skulls. The reduced coefficient 
of racial likeness for all characters between the Dunstable and Bronze Age type is 
still smaller than that between the Dunstable and Anglo-Saxon types, suggesting 
a closer affinity with the former, and as the difference is about 2 5 times its 
probable error it may probably be considered significant. In general form the 
Dunstable group is also more similar to the Hythe group than to the Anglo-Saxon 
group, but, when all characters are considered, though the coefficient between the 
Dunstable and Hythe groups is still smaller than that between the Dunstable and 
the Anglo-Saxon, the difference is not so great that it can be considered really 
significant. The Dunstable group is distinctly more divergent from the Iron Age 
group than from the Anglo-Saxon group. It shows a divergence of about the same 
order from the type of the Whitechapel 17th Century Londoners which, as is well 
known, closely resembles that of the Iron Age. The Neolithic type is, as might bo 
expected, very different from that found at Dunstable. 


Comparison of Individual Cranial Characters: Values of a. 

The characters in which the Dunstable type resembles most closely, or differs 
most notably from, the Anglo-Saxon and Bronze Age types may be seen readily by 

reference to Table IV in which the values of a « - •■■ * ■■■ * * ) are tabulated. 

n, + n, \ a, ) 

The most pronounced differences between the Dunstable and Anglo-Saxon skull 


* Biometrika, VoL xx B . 1928, pp. 801 — 875. 
t Ibid. Vol. xxiv. 1982, pp. 186-202. 
t Ibid. Vol. in. 1904, pp. 191-245. 



152 


Skulls from Excavations at Dunstable 


TABLE IV. 

Values of a= w,w< ~ ) between the Dunstable and other 

«» + n» V <r, ) 

British Male Series. 


Characters 

Anglo- 

Saxon 

English 
Bronze Age 

Hythe 

British 
Iron Age 

White- 

chapel 

British 

Neolithio 

m b/l 

57*92 

15-66 

75-20 

59-44 

100-34 

196-86 

100 H'/L 

9-33 

0-17 

16*00 

21-49 

47-21 

27-37 

100 B\R' 

8-50 

12-01 

9-05 

0*59 

0-94 

20-46 

ocA. 

2-20 

— 

0-79 

— 

— 

— 

100 O'HIOB 

2-42 

4-47 

0-00 

— 

7*30 

0-90 

100 NBjNH, It 

— 

— 

0-02 

— 

3-07 


100 NB/NW 

8-83 

1*79 

— 

13-36 

— 

18*82 

88 

— 

0-03 

0-09 

— 

0-21 

— 

1-93 

— 

— 

4-22 

— 

0-54 

100 fmb/fml 

100 of G, 

0*14 

4-31 

3-69 

— 

2-37 

0*09 

0-29 

0*60 

2-41 

__ 

0-84 

— 

Pl 

0-10 

5-77 

8-43 

— 

6-33 

7-30 

NL 

4*34 

0-09 

0-35 

0-12 

2-04 

0-44 

al 

0-7.1 

0-46 

1-63 

0-20 

2-69 

0*01 

L 

27-23 

0*12 

53-19 

5*36 

20-32 

62-13 

n 

20*88 

29-34 

2-65 

24-32 

36-60 

68-95 

B' 

3-23 

1-41 

0-02 

0-71 

1-08 

0-00 

OH 

— 

— 

0-02 

— 

20-70 

— 

27' 

0-12 

0*32 

— 

8-74 

— . 

o-oi 

LB 

2-34 

0-04 

10-32 

2-15 

2-63 

0-33 

Q 

4*18 

0-77 

1-71 

0-75 

33-85 

6*61 

u 

3-80 

11-93 

13-99 

0-23 

1-52 

16-39 

s 

5*83 

1*11 

15-96 

2-76 

2*14 

27*14 

G’H 

0-38 

3-34 

1-73 

3-34 

0*87 

0*04 

J 

13*45 

0-10 

10-97 

38-03 

40-38 

36*86 

NH, 11 

0-39 

— 

1-50 


0*70 


NH’ 

— 

1-51 

— 

0‘03 



2*51 

NB 

2-40 

0-20 

1*32 

16*20 

5-92 

13-89 

Ou R 

0-04 

4-98 

4-26 

— 

0-31 



R 

— 

— 

— 

3-37 


0-09 

O t , It 

0*16 

0-67 

1-06 

0-23 

0-00 

2*53 

0\ 

1-07 

6-16 

0-18 

- — 

14*44 



<h 

0-35 

2-55 

1*95 

— 

4-14 

0-42 

fml 

1-48 

0-83 

6*65 

1-19 

3*57 

0-33 

fmb 

1 -35 

M2 

0*27 

— 

0-07 

0-00 


types are shown in the maximum length and maximum breadth and the indices 
involving these absolute measurements, namely, 100 x B/L, 100 x BjH' and 
100 x H'jL. The Anglo-Saxon skull is on the average about 55 mm. longer and 
about 3-5 mm. narrower at its broadest part than the Dunstable skull although of 
about equal height. It has also a longer sagittal arc, a narrower face (bizygomatic 
diameter) and a relatively narrower nasal aperture. The Bronze Age skull is on 
the average 4-5 mm. broader than the Dunstable skull, but is practically equivalent 
to it in mean length and mean height. The indices involving the maximum 
breadth, 100 x B/L and 100 x BjH', are thus higher on the average in the Bronze 




Doris Dingwall and Matthew Young 


158 


Age than in the Dunstable specimens. The Bronze Age skull has also a larger 
horizontal circumference, a longer palate, a wider orbit and a smaller profile angle 
(Z P); the mean sagittal arc and the mean bizygomatic breadth, however, 
correspond closely in the two types. 

The Dunstable skull differs from the Hythe skull principally in maximum 
length. On the average it is about 7 mm. longer. The indices involving length, 
100 x BjL and 100 x H'jL, are thus higher in the Hythe skull. The Hythe skull 
has also a shorter cranio-facial base (basi-nasal length), a shorter sagittal arc, 
a less extensive horizontal circumference, a smaller profile angle ( Z P), a narrower 
face (bizygomatic breadth), and a shorter foramen magnum. 

The Female Skulls. 

As there are only 12 female skulls, any detailed comparison of the average 
measurements of this group with those of other types is not justifiable. So far as 
an average based upon such a small number of specimens can be relied upon, 
however, there is a strong suggestion that the female skull is definitely broader 
than the female Anglo-Saxon skulls described by Morant. Its mean maximum 
breadth is 1 42‘i) mm. as compared with 135 6 mm. in the female Anglo-Saxon. As 
the mean lengths in the two groups are equivalent, the Dunstable skull is thus 
relatively broader than the Anglo-Saxon ; the cephalic index is 79*9 as compared 
with 74-4. The mean 100 x B/H' index is in the Dunstable skulls 112*7 as 
compared with 105'7 in the Anglo-Saxons in the London Museums. Comparison 
of these characters and of the mean measurements of others in the two groups 
seems to indicate that the female skulls conform generally in type to the male 
skulls and differ considerably from the female Anglo-Saxons. 

Discussion. 

It has been suggested by Dunning and Wheeler* that the skeletal remains 
found in the most northerly kuoll at Dunstable are probably those of a Saxon raiding 
party which was exterminated in the 6th century a.d. by the local inhabitants. The 
detailed analysis of the cranial measurements of the group and the survey of its cranial 
affinities with other racial types, including Anglo-Saxon, Bronze Age and Hythe crania, 
would appear to suggest that the skulls cannot be considered those of Anglo-Saxons. 
It might be regarded as more probable, if the dating of the burials can be accepted 
ns approximately correct, that the skulls are those of a colony of the local 
inhabitants who were summarily disposed of possibly by a raiding party of Anglo- 
Saxons. This view receives support from the circumstance that almost 20 per cent, 
of the more or less complete skulls that could be measured are those of women and 
that these female skulls conform in their general shape to the male skulls. 

It is not improbable that a definite broad-headed element derived largely from 
the Bronze Age population may have persisted in certain localities such as 
Dunstable. On the other hand, the existence of a “moderate” degree of association 


* lot. tit. 



154 


Skulls from Excavations at Dunstable 

between the Dunstable and the Hythe crania is of interest in view of Stoessiger 
and Moran t’s* conclusion that the latter are probably for the most part the descen- 
dants of a Roman colony with a large central European element. The Five Knolls 
at Dunstable are in the vicinity of Watling Street and the Icknield Way, and it is 
not impossible that the broad-headed tendency which is a feature of the relatively 
homogeneous Dunstable series may be evidence of a similar European element in 
the inhabitants, introduced at the time of, and persisting to a notable degree after, 
the Roman occupation. The various objects of Roman date found with the skeletal 
remains at least provide distinct evidence of contact with this people. 

The view that the skulls represent a survival of Roman invaders is, perhaps, 
on the whole more probable than that they indicate the persistence of the Bronze 
Age population. The O.R.L. between the Dunstable skulls and an Etruscan series 
has been found by Dr Morant to be lower than that between the Dunstable and 
Bronze Age groups, and he thinks it not unlikely that the Pompeians and possibly 
a few other European series would also resemble the Dunstable skulls more closely 
than the Bronze Age type does. 

While some reasonable doubt may still be held as to the origin of the skulls, 
all the evidence that can be derived from the crania themselves seems to indicate 
quite unequivocally that they are not those of Anglo-Saxons of a recognised type. 

Measurements o f the Mandibles. 

Comparatively little is yet known about the detailed comparison of measure- 
ments of the mandibular characters in adequate series of specimens belonging to 
different types of skulls. Now that a comprehensive scheme for the measurement 
of this bone has been devised and described by Morant f it is probable that suitable 
and adequate data for the comparison of different types will sooq be available. The 
records of mean values of measurements of numerous characters, using the technique 
described by this author for series of mandibles associated with Anglo-Saxon J, 
17th Century London §, Badari Egyptian ||, Tibetan Af, Nepalese f, Tamil**, 
Fukien**, and Hylam** skulls which have already been published in memoirs 
in Biometrika, indicate that definite progress is being made in the provision of 
material for a proper study of the racial characters of an important part of the 
skull which has hitherto, largely through unavoidable circumstances, received but 
little attention. 

As there were approximately 40 more or less complete mandibles associated 
with the relatively homogeneous series of male skulls in the Dunstable collection, it 
seemed desirable that the mean measurements of the principal mandibular characters 

* Biometrika , Vol. xxiv. 1982, pp. 135—202. 

t Ibid . Vol. xiv. 1928, pp. 258—260. 

% Ibid . Vol. xviii, 1926, pp. 55—98 (Table xviii). 

§ Ibid* Vol. xvm. 1926, pp. 1—55 (Appendix 0). 

|| Ibid Vol. xix. 1927, p. 149. 

II Ibid. Vol. xvi. 1924, pp. 108-104. 

** Ibid . Vol. xxB. 1928. pp. 279-298. 





Biometrika, Vol. XXV, Parts I and II 

D. Dingwall and M. Young: Skulls from Dunstable 




Biometrika, Vol. XXV, Parts I and II 

I). Dingwall and M. Young: Skulls from Dunstable 


Plate II 



Typical Male Skull, No. 18. Norma lateralis, x circa “ 



Biometrika, Vol. XXV, Parts I and II 

I). Dingwall and M. Young : Skulls from P unstable 


Plate III 



Typical Male Skull, No. 18. Norma verticalis. Life size 



Biometrika, Vol. XXV, Parts I and II 

D. Dingwall and M, Young: Skulls from f) unstable 


Plate IV 



Typical Male Skull, No. 18. Norma occipitalis. Life size 



Biometrika, Vol. XXV, Parts I and II 

1). Dingwall and M. Young: Skulls from Dunstable 


Plate V 




Biometrika, Vol. XXV, Parts I and II 

1). Dingwall and M. Young: Skulls from Dunstable 


Plate VI 




Doris Dingwall and Matthew Young 


156 


should be placed on record. Only male mandibles have been dealt with. The 
measurements taken were those listed by Morant in his memoir in Biometrika , 
Vol. XIV. 1923, and the technique of measurement which he describes therein was 
carefully followed. The symbols which are used to represent the various characters 
are those introduced by Morant and used in all the memoirs to which reference 
has been made. Although the number of mandibles was only 42, defects in some of 
which occasionally precluded the measurement of particular characters, so that the 
number is not quite so large as the number of Anglo-Saxon male mandibles 
measured by Morant, it was deemed advisable to calculate the variabilities of 
the various characters in order to provide provisional estimates of these. The 
estimate of the variability of a mandibular character, which is based on a number of 
observations that may possibly be no more than adequate to provide a reliable 
criterion, is better than no estimate at all of the variability. So far, no attempt 
has apparently been made to furnish, or at least to publish, even approximate 
measures of the variabilities of the several mandibular characters. 

The means and variabilities of the mandibular characters are shown in Table V 
in apposition to the means of the corresponding characters in series of mandibles 
belonging to Anglo-Saxon, Badari Egyptian, Tibetan A and Fukien male skulls. 
The last-mentioned Asiatic group was selected for comparison in preference to the 
Tamil or Hylam mandibles, for which mean measurements have also been published 
by Harrower, because it possesses, according to this author, a heavier and more solid 
type of mandible than either of these two racial series. 

From a comparison of the data in Table V, it is evident that there is a very 
close resemblance between the Dunstable and Anglo-Saxon mandibles. The mean 
values of many of the corresponding characters in these two types are almost 
identical, and few of the remaining characters diverge in average value to such a 
degree that the differences can be considered real or significant on such numbers of 
observations as are available. It may readily be seen that the mandibles of the 
other skull types in the table are, as might be expected, less like the Dunstable 
type. 

Unfortunately, there is no other male series of British mandibles with which 
the Dunstable type can be compared. Miss Hooke* has published, however, the 
individual measurements of the characters in a series of about 60 unsexed 
mandibles belonging to the Farringdon Street 17th Century Londoners. For the 
indices and angles — the characters which describe the relative proportions and 
general form of the mandible — in this unsexed series, the means and variabilities 
have been calculated. These are shown in Table VI in comparison with the 
corresponding constants for the Dunstable male series and with the means 
computed for Morant’s Anglo-Saxon series by taking males and females together. 
Morant has stated that in his series the male and female indices and angles are so 
similar that, for the small numbers dealt with, the differences would almost 
certainly not be significant. From a brief scrutiny of Table VI it is evident that 

* liiometriha , Vol. xvm. 1926, pp. 1—55 (Appendix C). 



156 


Skulls from Excavations at Dunstable 


TABLE V. 

Showing a Comparison of the Mean Values of the Mandibular Characters 
in the Dunstable and other series of Male Skulls. 


Characters 

Dunstable 

Anglo-Saxon 

Tibetan A 

Badari 

Egyptian 

Fukien 

Chinese 

Means 

S.D. 

No. 

Means 

No. 

Means 

No. 

Means 

No. 

Means 

No. 

Wi 

120-99+ ['02* 5-47 ±0-72* 29 

123*7 

25 

117-0 

25 

109-5 

30 

121*9 

38 


1052*93 ± 1*06 

6*63 ±0*74 

40 

103-2 

45 

96*2 

25 

88*8 

32 

101*0 

38 

K 

32*51 ± 0*39 

2*46 ±0*28 

39 

33*1 

40 

30-6 

25 

32*6 

34 

35*2 

38 

zz 

45*43 ±0*37 

2*38±0*26 

42 

45*3 

57 

45*7 

25 

43*4 

36 

46-8 

38 

c r c r 

99*34 + 1*14 

5*92±0*81 

27 

100*3 

27 

93*8 

25 

86*8 

29 

97*2 

38 

rb 

36‘49±0*51 

3*29 + 0*36 

42 

36*4 

58 

37*2 

25 

36*7 

34 

39*4 

38 

rb' 

32-75±0-4.'i 

2-93 + 0-32 

42 

33*2 

61 

32*1 

25 

33*6 

39 

34*4 

38 

g: 

47-58 + 0-36 

2-30+0-25 

41 

48-7 

43 

4(5-7 

25 

41*1 

29 

43*9 

38 

c u c r 

36-70 + 0-63 

3*86 + 0*45 

37 

33*9 

40 

34*2 

25 

33-8 

37 

35*4 

38 

9o9o 

98*40+ WO 

6 -95 ±0-78 

40 

100-4 

33 

92*8 

24 

83*9 

31 

95*4 

38 

9n{J<> i h 

84-96 ±0-60 

3*82 + 0*43 

40 

87*9 

38 

83-8 

24 

82*0 

31 

83*1 

38 

9n9o i R 

84*92 + 0*07 

4-26 ±0-48 

40 

89-9 

41 

83-8 

24 

82*4 

31 

82*3 

38 

cJ 

21 -31 ±0-27 

1-59 ±0*19 

35 

21*7 

38 

18-8 

25 

20*3 

36 

20-1 

38 

Cyl> 

9*01+0*26 

1-67 + 0-19 

36 

9*5 

42 

8*1 

25 

9*8 

37 

8*9 

38 

m iPl 

27*99^0*21 

1*34 + 0*15 

39 

28*1 

59 

29*2 

24 

27*3 

33 

30-2 

33 

pA 

28-28 + 0-42 

2-61+0*30 

39 

28*2 

38 

25*4 

25 

26*7 

32 

30-0 

38 

Puffn 

7-25 ±0-27 

1*77 + 0*19 

42 

7*1 

59 

8*2 

25 

8*5 

36 

7*5 

38 

pAi 

25*02 ±0*50 

2*94 + 0*35 

35 

25*3 

41 

23-0 

24 

23*4 

31 

27*5 

38 


29*57 + 0*41 

2*45 ±0*29 

35 

30-0 

41 

28*3 

25 

29*6 

31 

32*9 

38 

Pafb 

4*55 ±0*41 

2*16±0*29 

28 

3*1 

6 

3*6 

25 

— 

— 

5*3 

38 

9»P«9o 

193-40+1-46 

9*14± 1*02 

40 

198-8 

39 

190-4 

24 

189*5 

31 

189*8 

38 

Hi 

48*41 ±0*83 

5*07 ±0*59 

37 

47*9 

51 

41*8 

25 

44*3 

35 

47-2 

38 

ih' 

13*39 ±0*27 

1*60 + 0*19 

34 

13*6 

35 

15*0 

25 

12*2 

33 

13*6 

38 

C r 1l 

05-00±0-88 

5*41 ±0*62 

38 

65*7 

48 

60-8 

25 

61 

33 

65-9 

38 

cJ 

58*55+0*92 

5-93 ±0-65 

42 

59*4 

44 

50*8 

25 

53*8 

35 

56*8 

38 

a t h 

36-50 + 0-37 

2-28 ±0-26 

39 

86*3 

40 

35*0 

24 

36*1 

31 

39*7 

38 

nit h 

26*60 + 0*35 

2*25 ±0*25 

41 

27*2 

51 

24*0 

25 

24*5 

31 

28*6 

34 

vA 

30*66 + 0*37 

2-37 ±0-26 

40 

30*9 

54 

28*9 

25 

30*4 

30 

32*8 

38 

Cpl 

76*16 + 0*79 

4-90 + 0*56 

38 

77*7 

42 

74*6 

25 

76*2 

33 

73*8 

38 

rl 

64*69 + 0*70 

4*29 ±0*49 

38 

64-0 

45 

58*4 

25 

67*6 

33 

61*5 

38 

ml 

106*76 ±0*80 

4*64 ±0*56 

34 

107*2 

31 

105-2 

25 

101*2 

33 

102*8 

38 

100 c r hlml 

59*27 ±1*01 

5*65 ±0*72 

31 

60*9 

27 

57*8 

25 

61*0 

32 

64*0 

38 

100 c r c r jml 

93*48 + 1*11 

5*57 ±0*79 

25 

94*4 

15 

89*2 

25 

86*2 

29 

94*7 

38 

100 g,gje„l 

129*67 ±2*1 2 

13*05 + 1*50 

38 

129*0 

32 

124*9 

24 

110*4 

30 

129*9 

38 

100 rVjrl 

49*55 ±0*84 

5*1G±0*59 

38 

51*5 

45 

55*3 

25 

58*5 

33 

56*0 

38 

100 c v b/cJ 

42*33 + 0*99 

5*86 ±0*70 

35 

44-0 

38 

43*3 

25 

48*4 

36 

44*5 

38 

100 g 0 a 0 fc r c r 

100*01 ±1*85 

9*62 ±1*31 

27 

99*3 

19 

98*9 

24 

97*4 

28 

98*1 

38 

100 cJi/c r h 

89*97 ±1*01 

6*21 ±0*71 

38 

89*9 

40 

83*8 

25 

86*9 

32 

86*0 

38 

100 m/c„c r 

37 *65 ±0*82 

4*77 ±0*58 

34 

40-4 

35 

44*0 

25 

36*9 

32 

41*3 

38 

md t h[c r h 

65*34+0*84 

5*02 ±0*59 

36 

55*2 

30 

57*9 

24 

58*6 

29 

60*4 

38 

Ml 

120° *88± 0*95 

5° *94 ±0*67 

39 

120° *3 

47 

125°*3 

25 

120°*0 

34 

121 u *0 

38 

Rl 

68 *70 ±0*98 5° *95 ±0*69 

37 

72°*0 

36 

70°*5 

25 

73**7 

31 

74°*7 

38 

Ql 

70 p *86+0*83 

5°*26±0.59 

40 

68° *6 

31 

67°*1 

24 

60° '8 

31 

70°*5 

38 

Cl 

68°*11±0*99 6° *05 ±0*70 

37 

68° *2 

32 

64°*8 

25 

7l°*7 

28 





Cl 

69°*04±0*93 

5"-60+0-66 

36 

68° *3 

35 

62*-9 

24 

73°*2 

28 

77°*1 

38 

Fl 

88-40 ±0-81 4° -31 ±0-58 

28 

87**0 

6 

90° *6 

25 

— 

— 

93° *0 

38 


Standard errors. 



Doris Dingwall and Matthew Young 


157 


TABLE VI. 

Showing a Comparison of the Mandibular Indices and Angles in the Dunstable Skulls 
with those in Anglo-Saxons and Ylth Century Londoners. 


Characters 

Dunstable 

(M.) 

Anglo-Saxon 
(M. and F.) 

Farringdon Street 
(M. and F.) 


Means 

a 

No. 

Means 

No. 

Means <r 

No. 

100 c r hjml 

59"27 ±1-01+ 5 -(if) ±0-72* 31 

69*4 

65 

58-59 ± 0-78* 6-06 + 0-66* 61 

100 c r c r /ml 

93*48 ±1*11 

5*57 ±0*79 

25 

92*7 

41 

90*43 ±0*94 6*60 ±0*66 

49 

mg„gjc v l 

129*67 + 2*12 

13*05 ±1*50 

38 

127*6 

67 

126*35 + 1*56 12*21 ± MO 

61 

lOQrb'/ri 

49*55 ±0*84 

5*16+0*59 

38 

52*2 

88 

51 -57 ±0*78 6*21 ±0*55 

63 

100 CybjCyl 

42*33 + 0*99 

5'86 + 0-70 

35 

43*9 

74 

48-64± 0-94 7-35 ±0-66 

61 

1 M g „<}(,/ Or f; r 

100*01 ±1*85 

9-62 + 1*31 

27 

99-3 

42 

99-87 ±1-25 8-46 ±0-88 

46 

100 cJifc r h 

89*97 ±1*01 

6*21 +0*71 

38 

89*8 

80 

87*71+0*99 7 *80 ±0*70 

62 

100 m'/CyCy 

37*65 + 0*82 

4*77 + 0*58 

34 

39*7 

70 

40-64 ±0-80 6-14+0-57 

59 

100 d t 1t!c t K 

55*34 + 0*84 

5*02 ±0*59 

36 

55*7 

55 

54*10 ±0*86 6*75±0*61 

61 

Ml 

120*88 + 0*95 

5“-94±0-fi7 

39 

12r*4 

96 

124°*11 +0*88 6° *99 ±0*62 

63 

Rl 

68°-70±0-98 

5 u *95 + 0*69 

37 

70" *1 

72 

70° *22 ± 1*00 7°*65±0*71 

59 

a l 

70’ *86 + 0*83 

5°*26+0"59 

40 

68” *5 

65 

67°-87±0-73 5° -70 ±0-52 

61 

Cl 

68° *11 ±0*99 

6" *05 + 0*70 

37 

69°*0 

57 

64 J *42±0*90 7°*05 + 0*64 

62 

Cl 

69° *04 ± 0*93 

5°*60±0*66 

36 

69°*4 

67 

66°-49±0*85 6°*84 + 0*60 

65 


* Standard errors. 


tho Dunstable mandible in its general form resembles the Anglo-Saxon mandible 
more closely than it does that of the 17th Century Londoner. In seven of the 
fourteen characters compared, even though the number of specimens available is so 
relatively small, the Dunstable mandible probably differs significantly from the 
Londoner, whereas the mean values of the characters in the Anglo-Saxon mandible 
are so similar to those in the Dunstable mandible that probably only one or two of 
the differences observed can be considered real. The fact that the Dunstable 
mandible resembles the Anglo-Saxon mandible more closely than it does that of 
the 17th Century Londoner from Farringdon Street in no way controverts the 
conclusion that the Dunstable skull is more closely related to the Bronze Age 
type than to the Anglo-Saxon, as it has been shown that the Dunstable cranium 
is more closely related to the Anglo-Saxon cranium than to that of the 17th Century 
Londoner from Whitechapel, which is very similar to the contemporary series 
from Farringdon Street. 

The number of fairly complete mandibles* available in the Dunstable series is 
obviously inadequate to justify any attempt to calculate the correlation between 
the various mandibular characters. 





ON THE APPLICATIONS OF THE DOUBLE BESSEL 
FUNCTION <K Ti ,rA x ) TO STATISTICAL PROBLEMS. 

By KARL PEARSON. 


Part I. Theoretical. 


(i) I left over from the discussion in Biometrika, Vol. xxiv. pp. 293 — 343 the 
case in which two variables, « and v, follow Type III curves of the form 


My i T i 


+1 


Myfi 


i+i 


and 

leading to the surface 

Jfry 1 Tl + 1 (Vo T a+ 1 , 

w = ^ y v v e” (Y i M+y a t,) u T iv T *. 


r(Ti+i)r(r a +i) 




where y x and y 2 and t* and t 2 are not equal each to each. We require to discuss 
the distribution surface of Y~v-u. 


Let X * v + u, and we have 

■ wjr + 

(Hi). 

r • i/7 1 T i +1 7 2 T * fl /• >. 

Let us write z 0 » ^ , tTisa^+i v ( 1V > 


T(T 1 +i)r(T 2 +i)2 r i +r 2 +1 k 

Now we want to integrate w from X » 7 to X = oo ; let us put X=*Yt and 
integrate £ from 1 to oo . 


Hence the distribution curve for F is 

z = £ 0 e~* (y »~ yi,r F T « +T i +1 J 0-i ( n+*a> y *(e — l) r i(tf + l) T *cft (v). 

This curve is closely allied to the Bessel Functions of the second kind with 
imaginary argument. 

Let us consider the expression 

</> (7) - F T » + v« J V*<n + *'- ¥l (t - l) r i(« + l) T »d< (vi), 

then if we had started below the OX line (see Diagram Fig. 1, p. 295 of the paper 
referred to above), we should have had to integrate X from — F to oo , or t from — 1 
to oo ; then by changing t to — t, we obtain precisely the expression (vi) with tj and r 2 
interchanged. Thus the only thing which remains to be changed when we put 
F negative is the term i ,r , which we accomplish by interchanging y x and y # . 



Karl Pearson 


159 


Thus the equation to our curve is 

2 = 2 0 e-l ( *«-n>^(|F|) (vii), 

where F is to be taken from — oo to + oo , but t 1 and r 2 are to be interchanged 
when Y is negative in <f>. 

(ii) I start with the integral 


w. 


Then 


» J g-i'iV («-!)+<•, rie-t in ^ (viii). 


= ( -1 ) ri+r, J i e~ lr i y "~ l+e i 7 Y r ,+'>(t - l)'i(t + 1 y,dt 

- (- 1 ) T > +r » F r i +, » | e~ <f i +e * )Tt (< _ i ^ y + iyidt .. .(ix). 

Returning to (viii), 


W = g-V'dr I V ( '» r+ ' , « 


or, Cj and c 2 being positive, 


mr ^n 

L (°i + c *) Y i ’ 


w = 


e~W 

(ei + cj)F 


.(x) 


Accordingly 






and 


rf r 1 + r,{y _ (-l)rir( T l+ l) rfr, / g-fe.K \ 

dcfidci* ” F cfcFs \(ci + Ci) T i + V ' 


Applying Leibnitz’s Theorem 

rfV W _ (- l) T i r(n + !)[(- 2F) T . ( _2F) r »~ 1 j-( Tl + l)j 

dci T idcj T « F L(ci + c 2 ) t i +1 Ta (ci + c 2 )'i+ 2 

Tj(t 2 - 1 ) (— 2 F) T a 


1.2 (Ci + Cj)' 

(-l) T i + T a2 T aFv 1 ^ a ’« y r(Ti +2) 
(ci + e a ) T i +1 


2F)v a 1 

Tck* (“ (n + l)x-(r 1 + 2)) + ...J 


f 1 + r- 

L (ci 


T»(Ti + l) , T 8 (T i |-l)(T 1 +l)(T l + 2) 


+ c a )2F + 1.2( Cl + ca)*(2F) s 


■-]- 


(xi). 


Now put Ci « iyi, Ca«i 7 a, and we have from (vi) and (ix) 
^(|F|)*F f i +T * +1 | V* (y i+r«>«(t-. l)n(« + 1 y*dt 

- (- \y^n { °t-°J y Y* ri *j W T 

aCi r lOC* T a 


2 T » F r «g-t ( vi +y « )r r(rt + l) r t 2 (ti + 1) Ta(T 8 -l)(T 1 -H)(r 1 +2 ) 1 

(i(7i + 7*)) T ‘ +1 L (Ti+7 i)Y l-Hto + V&Y* ~ + -J- 



160 Applications of the Double Bessel Function 9jf Ti , T> (x) 

Hence (vii) and (iv) for F x = v - u positive, 

, „ jgl e - y ,F Y r> 

r(T,+i)( 7 i+ 7*)’ i+1 

Ti I T a( T i+l) . r 2 (Ta- l)(n-f 1 )(t!4-2) t 2 !(t 1 -f T a )! 1 

L l!(7i + 72)r“ 2!( 7l + 7 a)^ 2 ' r^l^ + ytftY'*] 

(xii). 

The last term if the series be finite, i.e. t 2 an integer, will be as above. 

It follows that for F = 0 we have 

My i Ti + 1 72 t 2 +1 r(ri + T a + l) ... 

* y ~° “ hi + 72 ) r * +r > +1 • r ( Tl + 1 ) r (r, + 1) (X1U) - 

* 

This expression for « symmetrical in ri, r a , 71 and 72 . We can take for the 
u — v side of the curve 

e -y t Y Y r \ 

r(ri + 1 ) (71 + 7a ) r * +1 

f, , ti(t, + 1 ) t i ( r t — 1 ) (r 8 + 1 ) (t, + 2 ) , ^ ’( t; +t 3 )! 1 

L li(7i + 7 *)> r 2 ! ( 7 x + 7 *) 2 1’ * T 1 !r*!( 7 1 + 7 *) r iF T iJ 

(xiv), 

leading to the same value of z Y -o- 

(iii) In order to discuss the moments of the curve in (v) we require the sum 
of the first p terms of the negative binomial (1 — x)~ m . The following elegant proof 
was provided for me by Mr E. C. Fieller. 

The remainder after p terms of the Maclaurin series in the integral form for 
a function f(x) is 

R=c V\p)\/ {,,)<<x ~ t)V, ~ ldt ' 

But for the binomial f(x) = (1 — #)~ ,n , and accordingly 

f {p) (x) = (— IF {— m (— m — 1) (— Tti — 2) . . . (***" 7Yi —p 4* 1)} (1 — x)~ m ~P 

r(m+ P ) _ 
r(/«) ( } 

therefore R = f\ f (1 — » + 

r (m) r (p)Jn v 

Take 1 - u — (1 — tc)/( 1 - x + 1) } 

I *~* X 

then t = 0 , u = 0 , and t — x, u = x, dt = ^ du f and we have 

R * fT^r §-> J / 1 » a 

«(1 -x)~ m x I x (p,m), 



Karl Pearson 


161 


where I x (p, m) is the ratio of the incomplete to the complete beta-function. 
Accordingly the sum of the first p terms of (1 - x)~ m 

«* (1 — x)~ m — R 

■ (1 “ ®)~ m (1 - 1* (P > »)) - (1 - (m,p) (xv), 

which is the result we require. 

Returning to Equation (xii) wo have 

f Y ' d F = l' X> e~y « T i+« 

J o r(r*+l)( 7l + yt) Tl+i yi' j « ^ 

v Zl J. T|(ti+ 1)^> , T S (Ta-l)(Ti + l)(Ti + 2)^r* , 

l + TT~ y + 2T F ' 

+I -w^H <->■ 

where y = y%Y and yfr = 7# /( 7l -(- 7a ). 

Integrating out with regard to y we have 

r* F ‘ <,7= r^w( r(T,+,+1)+(T! n i)T * r(T - +>) ’ t ' 

+ (Tl+1 ^ Tl+2) 1) r(r J+s - 1)^» +...+ T t \r(s)r*) 

(xvii). 

First take s = 0, 

= M(l —yfr) T i +1 x sum of first r t termsof thenegative binomial(l —yfr)~ (r i +l) 
= MI l _ yl/ (T 1 ±\,T % +\) = MI 7] (Ti+l.Ta + l), 

7l+7a 

by the Lemma just demonstrated. 

Hence the total area 

= MI ^ (Tj + 1 , Tj + l ) + MI (r 2 + 1 , Tx+ 1 ) 

7i + 7a 7i+7a 

7i 7a 

ns g ^ ^j 7 ' * r i (1 - x) r *dx + | Vl 7i ;c T »(l — x) T idxj . 

In the last integral write 1 — x for x, and we have for the total area 


7i 

■n-M - — r I [ 7l +7a ® T l(i - x) T *dx + f X T i (1 — x) T tdx 
" (Til T*) (Jo J 7l 

7i+7a 

but the sum of the two integrals in the curled brackets makes up B (t 1( t 2 ). Hence 
the total area equals if as it should do. 

Biometrika xxv 11 




162 Applications of the Double Bessel Function 2ST Ti , Tj (x) 

Next take « = 1. 

We have for moment about Y = 0 , 

*»' - { r <'■ + *> + ■ =£ « r <« + o + 

+ |T ‘t ^ (r, - 1 ) r („)+•+... + T1 ^ V'*} 

— (a similar expression with ti and t 2 interchanged and 71 and 
7 a also). 

The expression in curled brackets, call it Q } may be read as 
Q - T (r, + 1) (r, + 1 + - l + 1 (r, + ] - 1 ) ir + (Tl - + T IjlL± i> ( Tg + 1 - 2) ** 

+ (T# + j _ 3) r +. . .) 

= r (T* + 1 ) «T, + 1 ) (1 - +)-<' X+U /,_* (Tl + 1 ,TS+ 1 ) 

- (n + 1) t (1 - ^r (r ‘ + ® > A-* (r, + 2, T*)). 


Thus 

/ii 1 


‘ T2 ^—I y (ti + 1, t 2 + 1) — / (t! + 2, r 2 ) 

72 -2s— 71 7l 


7i+7a 7 i + 7 2 

- T, ; * 1 ' , (t, + i,t,+ d+^/ 

7i — 7a 
7i + 7a 


7a 


(ra+ 2, Ti), 


7i + 7a 


= 7i ( T 1 4- 1, r 2 + 1) 4* / y o (^2 "I" 2, 

' 7a 7 i + 7 2 

r , (t 2 + 1, Ti + l)-h/ 7i (ti + 2,t 2 )^ (xviii). 


7 1 + 7a 

. Tl "L 

71 


7i+7 2 


7i + 7a 


But 


I 7i ( T i + 1> t 2 + 1) -b I 7 2 (t 2 + 2, Ti) 


7i + 7a 


7i + 7s 


+\] ?(r ,+ 1) ir v ‘ (i + Ta+ 1 jt^ -■***) ’ 


r 

r (n 

or, integrating the second integral by parts, 

7i 


72 


_ 7*._ Yi T, y« T » \ 

T* + i (71 + 7 a ) r i +r s+V ' 

Since the sum of the two integrals is as before the complete F-function, wo 
find from (xviii) 

, * _ T* + 1 r<T, + T, + 2 ) 7 i ri 7 « r * 


Ml 


7* F (n + 1 ) T (r* + 1 ) (71 + 7 a) T i +T » +1 

Ti + 1 r (t, + T* + 2) 7i Tl 7a T * 

71 r(n + l) F (t* + 1 ) (71 + 7 ») t i + t « +1 ’ 



Karl Pearson 


168 


or, since the second term in both lines is symmetrical in n, 71 and r l( 7*, 

, Tg + 1 Tl + 1 

f^l — — , . *, 

72 7i 

We have next to take s = 2 in (xvii), and have 

^ ^ l -g (r(r, + 3) + Tl 1 4 | 1 r s r (r, + 2 )^ 

(ri 4- 1) (t x + 2) 


(xix). 


4- 


2! Ta(T a -l)r(r a + l)^r a 
(t 1 + 1)(t 1 + 2)(t 1 + 3) 


T a (r a — 1) (t 8 — 2) r (r a ) \fr a + . .. up to yjr T ^j 


4* a similar series with interchanges of T t , t 2 and y lt y 2 


(1 — ^/r) T i +1 / 


Ti + 1 


72 


(ti 4-J) (r a 4-2) 
2 ! ” 


t 2 (t 2 — 1) yfr 2 


2 (^( r 2 + 2) (t 2 4- 1 ) 4 — yy- (r a 4- 1) t 2 yfr 4- 

+ ( T , _ i) ( ts _ 2) vr a + ... up to r>) 

4* a similar series with proper interchanges. 

Replacing (t 2 4- 3) t 2 by (t 2 4- 2 - 1) (t 2 4- 1 - 1), t 2 (t 2 - 1) by (t 2 4- 2 - 2) (^4- 1 - 2), 
(r 2 - 1 )(t 2 - 2) by (t 2 4- 2 — 3) (t 2 4- 1 - 3) etc., we can rewrite our /x 2 ' as 

«' - (1 [(r. + 2 ) (t, + 1 ) {l + ’ii - 1 + + ^ ~ itat ? 1 +■+... to .} 

-(St, ♦ S)(n + D+ {i + T 1 n 2 * + ’ (, ‘ + S t r* ~ r+- to 

+ (T, + 1> + {l + 2 (r ‘ + fit + ,(D+?)<at.») +•+... to *•.-■)] 

4- a similar series with proper interchanges. 

The first series in curled brackets is (1 — ^)~ <r i +1) / 1 _^ (ti 4- 1, t 2 4- 1) ; the second 
series in curled brackets is (1 — >/r)“< r i 4a > (ti 4- 2, t 2 ). 


To obtain the third series we note that, if 

n + 2 _ (ti + 2) (t , + 3) J rt 


*= 1 + tt* + 


2 ! 


>/r a 4 - . . . up to ^r T »' 


.r a -l 


= (1 - (t, + 2, Tg), 


then the third serieB = -Af^ 

aifr 

- (1 - ( Tl + 2, Tg) + (t, + 2) f (1 /,_* ( Tl + 2, Tg) 

B (tx + 2, t,) • 
11—2 



104 Applications of the Doable Bessel Function SKT T|iT , (x) 

Substituting 

£( T « + 2) (t* + 1) /x_* (t x + 1, t* + 1) — (2 r* + 3) (ti -f 1) ^ Ji-*( T x+ 2, Tg) 

+ ( T x + 1 ) ^ Tl + 2 ’ + ( T 1 + 1 ) ( T 1 + 2) (1 -yfrj* * l -+ ( Tl + 2 » T# ) 

Vr r »(l^i/r) y ir(r 1 4-Tg+ 2)_ 1 

r* ( t i + 1) r( r i + 1) * J 

+ a similar series with proper interchanges, 

- <=±>Ifr*±l)/_^ (Tl+ !, „ + *<»+»Ln±l > / j(Tl+s . „) 


7* 

7i + Vs 

(r 1 + l)(r, + 2) 


7i 


7i + 7 2 

/ f T . 9 - \ _ 7 « r * 7 i T * r (ti + Tg + 2 ) T* 

1 (71 + 7») r ‘ +r,+1 r (ri + 1 ) r (r, + 1 ) 7a 

7i + 7a 


+ a similar series with proper interchanges. 

We can now combine the two series and find 

»' - — V T * — {' + l) ■ + ^ (T. + 2, *>} 


7i + 7a 
2 (Tg -f 1) (Ti -f 1) 

7i 7a 


7i + 7a 

I y, ( T 1 + 2, Tg) 4* / (Tg + 2, Ti) 


7i 

7i + 7 2 


7i + 7a 


+ — — y, ( T « + 1. Ti + 1) + I 7] (Ti + 2, Tj8)| 
7i + 7a 7i + 7s 

r (tx + T2 +2) /n + T*'\ 

1) \71 7*/ 


7i Tl 7* T * 


(xx). 


(7i + 7») r > +r « +1 r (rj + 1) r(r a + 1) 

But we have already seen that 

I fri + 1 T.+ 1 U/ <V 4 . 2 1 - 'l — 1 r (n + T g + 2 ) 7 i t ‘ 7 * t * 7 » 

— ’ 1 r ( ri + 1 ) r (r s + 1) (yi + 7 a ) r i +T » +i t» + i ‘ 

7i t 7a 7i + 7s 

Similarly 

I (t. + 1, Tt + 1) + 1 ( T , + 2 r.l = 1 ( Tl + T « + 2 ) 7 i t> 7« t * _7 i_ 

~~ ’ r (n + 1) r (r, + 1 ) ( 7l + 7 *)'1+'1+ 1 n +1‘ 

Again integrating by parts one finds 

/w (n + 2, + ? + »>., + fa + 1, r. + 1), 

and similarly 

+ % = r* .. 7i Ti 7» r * r (n + r, + 2) , , . , 

’ r, + 1 ( 71 + F^+iyro^+T) +/ * (T * + 1>Tl + 1} - 



Karl Pearson 


165 


Substituting in (xx) and remembering that 

(ti + 1, Tj + 1) + 7* (r* + 1, ti + 1) = 1, 

we find 

» m ( T a 4- 1) (t> 4- 2) _ 2 (Tg Hh 1) (t i 4“ 1) (ti 4- 1 ) (ti + 2 ) 
1 7t a 7172 7i* 


7i Tl 7a Tl 


r (n + Tg + 2) 


(71 + 7 *) ri+r#+1 r (n + 1 ) r (r f + 1 ) 

j T 2 4- 2 ^ T i4- 2 _ 2 (rg 4- 1) _ 2 (ti + 1 ) + Tg + ti) 

1 72 71 72 7i 7» 7if ’ 


and since the last term in curled brackets vanishes, 

( T 2 + l)(Tg+ 2) 2(Tg+l)(T 1 + l) (tj.4-1)(ti+2) , .v 

Ma 3=5 2 4 g (xxi). 

7t 7i7a 7i 


Tg 4" 1 , Ti+l 

‘ jr + :ra ■ 


Subtract fii % and we have ug - 

72“ 7i“ 

Putting $*3 in (xvii) and proceeding in the same manner, we find after con- 
siderable algebra 

/ = (Ta + 3) (Tg + 2) (t! -f l ) ^ 3(Tg-f 2)(Tg + l)(T 1 -f 1) 

S 7a s 7a*7i 

+ 3 (t*+1) ( t i 4* 1)(ti 4- 2) _ (ti 4- 3) (ti 4 - 2) (rg 4- 1) ( xx ii) 

7271* 7i* V 

Hence transferring to the mean we find 



The algebraic labour for /a/ and its transference to the mean is so considerable that 
it is desirable to use the characteristic function of the distribution. 


We have first to find 
and this by (xii) 


jze^dY, 


= ^7i ,1+1 7» T,+1 r e -(y^r Y r, (i + . 

r(r a + l )( 7 i + 7 *) T ‘ +l J« V I 


t »(ti+ 1) 

• (71 + 7*) r 


r»( Tg — 1 )(Ti+ l)(Tt4-2) ^ + Tg! (t! + Tg) 1 


)L\ 

2 ! (71 + 7g)* Y* ~ r Tg ! Ti ! (71 + 7g) r » F r »/ 


dY 


+ a similar series with the proper interchanges. 

Write (7* — w) Y=*y, and we have for the characteristic function F(a>), 

M 'Vt T i +1 'v« r i +1 1 f 00 

Fl w - r (r. + 1> fa + ■»)•■> ' /. 

v }, , Tg(Ti + l) hi - m \ T g(Tg-l)(T 1 +l)(T 1 + 2) / 7g-m \» 

1 + lt(7i + 7»)\ V / 2 ! (71 + 79)* \ y ) 


+ ... + 


L ! (71 4- 7g) V 

— \ *1 dy + a similar series, 

Tg! ti!(ti + 7 i) , « \ y / j 



166 


Applications of the Double Bessel Function *K ruT , (x) 

or, f (.) - ¥ (1 - 1 „. „ (T , + 1, T , + 1) 

(71 + 7*) 1+1 (t» — ®) 2+1 \ 71 + 7 a/ 1 -^frr„ 


Hence 
F(o) = 


•f a similar series with to changed to - &), since Y becomes — F, and 
ti, 71 changed to T2, 72 » etc. 

M r / . 1 . 1 v 


/ Q>\ r «+ 1 , 

(- G) V 

1 


V 72/ 

k 71/ 

_L_ 

itf 


/, 0)\ T,+1 1 

. &>y 

1 — 

1 + - 

\ 7 s/ 

^ 71/ 


“ 1^72“" ( T 2 + 1 » T *+ ^)* 


But the sum of the two incomplete B-function ratios is unity. Accordingly the 
characteristic function* 


F(<o)* 


l 

(-a 

ri 

KJ 

r 1+ l 

1 


.(xxiv). 


Now the logarithm of the characteristic function is given by 

1 n / \ ^ 1 ® X2W 8 X#G>* 

log F(m) = yr + -fr + ••• + *T + •" 
where Xi, X*, ... X., ... are the semi-invariants. Thus 


.(xxv), 


X-i«u X*® 2 X,o>* 

Tr + - 2T + ...+ Tf + ... 


“ - < T * + x > ] °S ( x - J) - ( T * + 0 lo s (1 + ■£) 

/ -v /ft) 1ft) 2 l ft) 3 1ft)" \ 

_ (r 2 + 1 ) (- + 2 -a + 3 -3+ ... + - 7jS + •••) 


, , 1 \ / ® .(-1 y - 1 < o ‘ 

— (ti + 1) (— —-a — 1 1 + 5 — +••■ + — 


72“ o 72” « 72 

r co 1 w 2 1 a) 8 

V71 2 71 2 3 71 8 

Equating powers of w we have 

Tjrfl _ ti +1 
72 7 i 


ft)* \ 

*« + 4 


Xi = fi\ ' 


and so on. 


. - /T 2 + l T X +1\ 

V 72 s 71 / 


.(xxvi), 


Tho area 3f of the ourve must be made unity to obtain the characteristic function. 



Karl Pearson 


107 


The first three semi-invariants check with the values found directly for the 
first three moments. 

The above equations suffice to give the mean, standard deviation and fix, A for 
the distribution of the difference of any two statistical coefficients, satisfying equa- 
tions of Type III, when these coefficients are measured from the start of their 
distributions. 


(iv) The previous investigation will not only have made it clear that we are 
dealing with a function closely allied to the K m (x) Bessel Function in a generalised 
form, but also that it is desirable to work out some of its properties. We may write 
our curve in the form 

• = (i (71 + 72) Y ) (xxvii), 

where, if v = \ (ti + r 2 + 1), 

K u .r , (i (71 + 72) Y) = c 0 (i ( 7l + 7 # ) YY I" eri «n+**r. (t - 1 )'. (t + 1 ) 7 * 

(see Equation (v j), c 0 being a constant. 


Thus 


r, (*) = c n x v J (t - 1)7 (< + 1)7 dt. 


If we choose c 0 so that when ti = ts we have 9JV, t TJ (at) «* K v (a'), where v is now 
n + |, we see that our 9i’, I , ri (x) will pass over into the Bessel Function K v (x ) ; 
this requires us to take c 0 = ^•7r/(2 ¥ T (v + ^)). Thus our curve becomes 


z 


1 M (7x + 7 , ) / ti \ r i +1 / _7a Y ,+1 

2 r (t, + 1) T (r 8 + 1) \7 j + 7a/ V71 + 72/ 


r(*+t) 

r(i) 


e -pi<r 1 +n>i' 


x (i (71 + 72) Y ) v 9 { ri , u (i (71 + 72) Y) (xxviii), 


where p = (y»- yi)/(yi + 72). 


I propose to call 

^u.r ,(*) = (_ e~ xl (t - 1)7 (t + 1)7 dt (v = £ (t, + t s + 1)) 

(xxix) 

the Double Bessel Function*, and to study some of its properties. We may first 
find the differential equation which it satisfies. 

Write for brevity u = <K ri f T| (x), and differentiate twice, 

t ( t ~ 1 ) Ti (* + v r,dt > 

S - ~J )U - 2W'- 1 (t - 1)7 (t + 1 y*dt + X* 1*0-** t* ( t - 1)7 (t + 1)7*. 


Accordingly 

+ + *’ +1 jf® - ** !) - ( 2v + *) ( } (* - UM* + !) T *^ + **«. 


* Of course only a Double Bessel Function of the second order and imaginary argument. 



168 


Application s of the Double Bessel Function 9T Ti Ti (x) 


Now 


j V** «(<*— 1)(<— i)'i (t + ir« dt 


dt 


{t-iy^{t+iy*^dt 


= [ i « **{(ti+1)(< + 1) + (t* + 1)(< — l)}(i-l) T i(« + l/»^, 
since the term between limits vanishes, 

= | | e~ xi t (2i> + 1) (< — l) r » (< + 1 )'* dt + (tj — t*) J“e- M (t-iyi(t+iy»dt. 

Substituting we have* 

9 cPu du , Q y 

dx 2 ^ X dx + ( T i~ T a )« + ^ 2 ) = 0 .(xxx). 

This is the differential equation satisfied by ^, ,,»,(«). It may also be written 
dPn du 

** d^x + * dx ~ u ^ Tl + ^ ( T * + 4 ) + (® + £ ( T 1 - T »))*) =0 • • -(xxx tfis), 

which shows its relation to K r+i (<r) when t 1 = t*, for K J+i satisfies the equation 
_ J> d t u , du ... 

^ 55 * +a: di _w ! (T+ ^ +a ) = a 


(v) We next turn to the recurrence formulae. Consider the expression 
Integrating by parts, since the part between limits vanishes, we have 

i - * J i)i (t + 1 y»dt. 

Differentiating out, v 

I = {n (t + 1 ) + T,(< - 1 ){ (t - iyr\t + 1 )'.- 1 dt 

r °o « 0O 

= (2u — 1) e~ xl t(t— l/i- 1 (t + l) r « -1 dt + ( Tl - T*) I e-* 1 (t — l) r i -1 (t + iy<r l dt. 

Accordingly * 1 

= 2 ~T \ 1 e ~* <* - (* + 1) T * dt - g=- T * J V* (< - l)v* (f + lyn* 

_ (xxxi). 

Multiply both sides by •J7roc v - 1 /(2 v - 1 r(v~ J)) and express the right-hand side in 
terms of the Tl ,r t functions. We have 

typ qqV — 1 f 00 

2 ^ r I>Z T )J l e-«t(t-iyr'(t + iy*-'dt 

" - (xxxii). 

* The other solution of this equation is the Double Bessel Function Jr,,.* (a). 



Marl Pearson 


169 


The left-hand side may also be expressed in terms of 9 T t ,-i > t ,(®) and 

r(|/- i) ^ 2 ^ (®n-4.r t (®) + 

so that we have 


w - F ' r(ii(n+^ r^ (g -^’- w + 


from which a table of 9f Tl , T> (a;) could be fairly readily computed. 

Next consider 9i r Tl+liT , + 1 (a:); we have 

^ j" U) 

^Wh. r,+l (*) = 2»+W(tlTJ) ) 1 e ~** ^ ~ 1 ^ 1+1 Q + 1 ) T,+1 * 

“ F +r r (V+|) 1 dt ( ~ *-**) ^ ~ 1)T,+1 ^ + 1),,+l dt> 

or integrating by parts, 

“ 2>+ ir ' (v-Hf ) Ji e ~ Xt K t » + *)(< + 1 ) + ( t * + !)(<-!)) («- 1) T *(« + l) r *<fc 


.(xxxiii), 


*j rjr x v r r 00 

Tj^j) [(2v+i> j t <r«t(t-iyi(t+iy*dt 


2»+i 


+ (ti - ra ) !%-** ( t - l) r i (t + 

= ¥Y(i> X +i) 1 ^t(t-iy^t + iy>dt + 9^.,.<*) (xxxiv). 

Again 

= - J“ —jp-tt - l) r ‘(< + l) T *d< 

= J V* [(< - 1 )*> (t + 1 ) T a + {n« (< + 1 ) + r, < (< - 1)} (t - 1 )v* (t + 1 ) T .->] 

«=»J «-**(< — l) r i(< + l) T »<ft + (2v— 1) | e~ x *(t— l) r i(< + l) T «dt 

+ (rt - r») l“e~ xt t (t - l)'!" 1 (t + 1 ) T * -1 dt 

+ (2v - 1) 

Vtt .'C*'"* 1 

Multiply both sides of this equation by we ^ ave 

- W+ 1 *«w 



170 Applications of the Double Bessel Function 9T Tl , T , (pc) 


and thus by (xxxii), 

- 1 S^.T, (*) + g^y* 9r n .t, (*) - [g^Tf VvnW + (•) 

(xxxv). 

Hence substituting (xxxv) in (xxxiv), 

9^+1, r,+l (®) = 9^-1, rr _! (*) + g + g^) 9f Tj>T ,(*) 

“ (2|/ - l) ®’n-l,T t -l(«) + g + f (*)' 

Or finally, 

«'r 1 +l,r,+l(®) ~ 9^,-X (•> = I' (l + ^ly) «*„(*) 

1 — To\* 


- (grZl) ^r,-l,r r -l(*) (XXXVI). 


This formula reduces, when ri — r 2 , to the familiar and much simpler one 

K^{x)-K^ x {x)= 2 v K v (x), 

oc 

for the Bessel Function of second order and imaginary argument*. By means of 
(xxxvi) we express 9 T T1+l> Ta+ i («?) in terms of ( K ri t r% (x) and 8?,^ Ta _i (a). We shall 
find it convenient to use the symbol T TltTi (x) in the following manner: 


^1. ’.(*)= Y* l\u+i) (xxxvii). 


Then by (xxvii) if Y' = 4 (71 + 72) Y and p = (7, - yi)l(y! + y a ), 
and by (iv), 


( - 7l -Y ,+1 i 

\ 7 i + 7*/ 

\yi + ya) 

r 

1’ (r, + 1) r (r, + 1) 

( 71 Y 1+l i 

f 72 \ 

i-g+i 

\ 7 i + 7 */ 

\yi + 72' 

1 


■-( T1 +iyr ( ; 1+ ,j7 Ir ' ' 

or by (xxxvii), 

* = ^4(71 + 7s ) ( f +T) 1 r8 (^ + 4 )e-^T n , r ,(r') (xxxix). 

The element of frequency is 

zdY=zdY'j\{ 7l +7g). 

Hence 

*«*r- Jf(i + p)^(i- P r.« f y-^g±i^e-^T, 1 , T ,( 7 ')dy\..(x^ 

Putting n = Tg — 1/ — ^ we have 

zdY=M(l-p'y+ie-> Y 'T v <J’)dY' (xlfcfc), 

# Watson, Thsory of Bessel Fvmction* t p. 79. 



Karl Pearson 


171 


which agrees with the value given in Biometrika, Vol. xxi. p. 183, Equation (xli), 
if we remember that we are here dealing with one-half the full curve, and that p 
may be either positive or negative. Sometimes one and sometimes the other of the 
two expressions 8f Tl , T ,(F') and 7 ' Ti , Tj ( Y') may be the more serviceable. 

Writing (xxxvi) in the form 


w- % (i ♦ *,.,w + (i - (£-_?)") 

we may replace the 9f T ,, T , by the functions and find 
T-v «,..«<*) - (l + 

♦stM 1 - (Sr-iT) (*) «“> 

When we make ti = r a = v — this becomes 

*■•«<*> - 2 .+, r.<») + rj~i 

and is identical with the formula (xliv) on p. 184 of Biometrika , Vol. xxi. 


(vi) In the next place we may consider the differential coefficients of our 
functions 

d JZ. n&V [ 00 

J x (** r. (*)) = 2vX r ~ 1 <Kr 1 ' r , (*) - j. ^ J j t (< ~ 1)'‘ (< + l)**, 

and thus by (xxxv), 

+g~;«w*)- g=7)‘ *,*.«<»>) 

— •*-**,*...<*> (> - (i3)') - 


Thus 
1 d 


SS^W-))— (i - (5h?) 

(xlb)- 

This reduces to the familiar relation 

1 ~{afK ¥ (x)) = ~x^K,^{x) 

of the Bessel /^-function, when tj = t, = »»-£*. The corresponding result in terms 
of r TliT ,(aj) is 

- s* , (* - G£r?)') *W. w - (*> 


.(xliii). 


If ti.= t*, then £(J r (*)) — 

* Wfttfon, Theory of Bessel Functions, p. 79, Equation (5), with m=l. 



172 Applications of the Double Bessel Function 9T Tl , Tj (a?) 

We may put (xliii) by aid of (xli) in the form 




X +I, r t +l 


whence if ti = t 2 the result 

Th-i ( x ) = 


/ \ _ / Ti-Tj \ m /\ X d 7r lt Tg W 

W “te + 1 + (^1> } 2I/ + 1 d* 

(xliii its). 


2 r 


T^x)- 


X dTr lt r t (a) 


2* + l~' v ~ / 2v + l dx 
of Biometrika , Vol. xxiv. p. 309, Equation (1) directly follows. 

(vii) These results enable us to determine the equation for the mode of the 
curve 

z = z 0 'e-> Y 'T ri , 

where Y 9 = £ (71 + 7 a) F. 

We must put dzjdY or dz/dY' = 0 , whence we obtain, if Y' correspond to the 
mode, i.e. = l (71 + 7 a) F, where F is the variate, v — u, at mode, 


f d 




y'=P' 


= 0 , 


or, by (xliii), since v *= £ (t! + r 2 4* 1), 

p - - T -i^ - — - - |l - (Spfl (xliv). 

H Ti + T 2 Ti + Ta l \Ti + T 2 / J r Ti Ti ( F') 

If we take ti = t 2 , this agrees with a somewhat different notation (p negative), with 
the result 

pm 2 »< -I 

of Biometrika , Vol. xxi. p. 184, Equation (xlv). v 

The simplest form of the result is in the 2f Tl , Tg («0 functions, namely 

4tit 2 g Tl ~i, Tt -i (F') 




Tj-Tg 

ti + t 2 (n + Ta ) 2 « Tl , Ta (F') 


.(xliv fa's). 


Clearly if a Table be formed of ^r 1 ( r t (&) or T TltTt (x), we can for every entry of 
ti, t 2 compute the right-hand side of (xliv) or (xlivits), and then by backward 
interpolation obtain p * Since p always lies between +1 and — 1, 71 and 7 a being 
(see Equation (i)) positive, we may have to find F' either from (xliv bis), or from 
the equation w 

Ta-n . 4 tit 2 < Kr r -y-'v-i(Y') 


P = - 


t 2 +ti (n-fTa) 2 ar Ti Ti (F') 


.(xliv ter), 


as the case may be. 

(viii) Ta&tetf te be computed. Before it is possible to illustrate the statistical 
value of the Double Bessel Function, it is needful for tables of it to be computed. 
Such tables will be calculated for the arguments tj and r t proceeding by 0*5. In 

* The auxiliary Table thus formed will correspond with the columns entitled 14 p” on pp. 195 — 201 
of Biometrika , Vol. xxi. 



Karl Pearson 


178 


order to construct such tables by aid of the recurrence formulae of this paper we 
need to compute fourteen primary values of T TltU (x) or 2V„ ,»(#)• The reader may 
be reminded of the following fundamental relations, in which i> = + 

P>(*( Ti + t 2 )+1) I, e_ * , ( < - 1 ) Tl (<+ 1 )''^. 

1 X v 

^ Fr (HT1 + Tg ) + i) W- see Equation (xxxvii>, 

T * ( * ) = v/7r FrTiCrf^Ti) l" e ~* ( ‘- 1)T,(<+1 )T ‘ *■ 

see Equation (xxix). 

When n « r 2 , p = t x + J and T H , T1 (#) becomes T ri +± (x), the single ^-function, where 

T,i+i ^ = jv 2^+rf^T+T ) Kri+ * 

and K ri+i (x) is the Single Bessel A'-function. 


(1) T_i. _*<*)• 

K 0 (x) will be found tabled to 21 figures by Aldis in the R. Soc. Proc . Vol. LXIV. 
pp. 219—221. 


(2) 2L ii0 (a). 

^-4,0 (®) = ^2 r*(f) ^ li e xt ^ ~ ^ 

Take z % ~t — 1, and we have 

^ 7 -4.oO z; ) ^ * f o e x **dz 

or > tlj,,, ^ r*TS ) e * 

(3) T 0 , _!(*). 

Put #(< + and we have 

m , s V2tt _ f” e-*"* , 

V2w 


r*(l) 




Here $ (1 —a t j x ) may be taken from the probability integral table. 



174 Applications of the Double Bessel Function 9T T|i Tj (x) 
(4) ^o, <>(*)• 

r. _ i»\ =. _ 

V 27T 


7 o, 0 (#) = ^ ( a ') = ^ v' a' K\ (x) 


(5) 2*.o(4 


I* 00 

“Wi 

= Je _ai . 




n..w = 272 Pij )" 1 j>-»* 


if 

Hence 

(6) n,i(4 


n.o(4 


— 1 = 2®. 

2 y/2ve~ x 


f*(» • 


or, taking «(£ + !)= J?<®, we have 


w- 


is/'iir f 00 g~i ul 


r 

J‘2 


whence integrating by parts, 


f*(i) Ja Vsr V27T 




^0,jW r* (i) ^ V^27T J(1 — «2 ■'*)}• 

(7) We may note two formulae here which may be of service : v 

y 0, r, 0) = 2 r,+l P^Tj+T) jj e ~ Xl V + ^ dt ’ 

and putting &’(< + ])=* u, we have 

“ 2^,+i P(f^TT) } fc e ~ UuT * du ( xly ). 

which throws back the computing of T 0ir> (x) on the Tables of the Incomplete 
V-Function. 


(8) Again 


^T‘' 0 ( a; )— 2 r i+i P^n + i)® *}, «'* < * _1) (4-l) T »dt 

f e^ifidu 

J n 


2 r i +1 P(Jn + 1) Jo 

1 r(r, + l) 

2^* +1 F(i7i+i) 




.(xlvi). 



Karl Pearson 


175 


Thus T rii0 (x) may be easily computed by aid of tables of the complete T-function 
and Newman and Glaisher’s Tables of the Exponential. Putting in (xlvi) n=*0, 
we have 

T 0 , 0 (#) = as in (1). 

Putting in (xlv) r a = 0, we have the same result. Again putting tj = ^ , we find 


T i.o(*) = 


J r (S) 
2^2 rv|) 


2 V27T6 - * 

r*(ir 


which agrees 




which agrees with (2). 


' X. O * Mi ' 

rp / v l r(J) - / 7T 1 

,o(a ' )_ V2 f * (?) e " V 2 p* (f > 1 


y-x 


(9) 



T-i , 1 0) = JV* (< - 1)" 1 (£ + 1)* fit. 

Now 

Ao (®) — [ ($ — 1 )"J (£ + 1 )“i eft, 

J L 

or 

e-* Ao («) = J V* <‘ +1 > (< - 1 )-* (t + 1 )"1 dt. 

Accordingly 

£ (e-*K 0 (®)) - - | V* «+» (« + ])! dt, 

and 

T -i , i (*) = ~ 1 «< e* £ (e~ x A> (■*)) 



But* 


therefore 

= 4a'(Ai(a’) + A 0 (a!)). 

0°) -*(*)• 



7 j _ _ j (&) = ^a; J e _I< (< — l)i (t + l) - i dt. 

Now 

K 0 (a:) = e~ x JV* <*-*> (t- l)~i(t + l)~i dt, 

or 

£ (K 0 (*) e*) - - 1 “ e-* “- 1 * (t - 1 )* (t + 1 )~i dt. 

Thus 


or, finally, 

Tj ,_j (a;) = (A'x («) - JT 0 (®)). 


To find and T^^(w) we may use the values of Kq(x) and Ki(x) provided 

by Aldis in the R, Soc. Proc . Vol. lxiv. pp. 219 — 221. 

* Watson, Theory of Bessel Functions , p, 71) (7). 




176 Applications of the Double Bessel Function W T „ Tl (x) 

(11) TV 1 co- 

if t, be a positive integer 

^ pjgf . T) . «■ J,v*« « + D-d, 

* T ‘ +1 f f 
- 2-1+1 r*( 4 r,+ l) ck T t)i 

(— l) T a a ? T -* +1 d r * (e~ 2x \ 


2 T a+ 1 T*(Ft s + i)^^ ; 


* ( e _\ 
t T a \ a? / 


i r (t s + 1) / 2* (20* (20?\ ( lvii) 

2^r*ar,TI) l 1 + IT + "2T + - + t,! ; - (xlvn> 


If Tg = 1, 


7 , o,i(0 = -e-*(H-20. 

7T 


(12) 2V, (0- 

We have at once from (xlvi) 


2*1.0 (0 

7 T 


(13) 3V,! (0- 

r,.! (O = 7 ’tj+j (O = 2'i (0 = i «- (1 + O- 
Soe Biometrika , Vol. xxi. p. 183, ftn. 

(14) 

We will prove first a recurrence formula of some service in tabulating Double 
Bessel Functions. 

^ - i*V. n(KC'^) t i >J, V(, - 1) - (,tl),,(,: 1 + 2)< “ 

1 a; r i +r « +a 

= 2 T 1 4 ^* r a (f(T 1 + Tg)T|) 

x (t - l) T i +1 (t + l) r » dt + 2 e~ xt (t - 1) T » (t + 1 ) r * <2tJ , 

or 2W - T’n+i.r.CO + P(|^ T 7|y T|j xT '"" {x) ( x,viii >- 

Put ti = $, T a * 0, and we have m 

TV i(0 = 7|,o(0 + « r io(0- 

But by (8) 

n (aA « — - n (a?) = * F (I) 

J t' oW 4V2P(i) ’ i *.°W- 2 V2P(4) * 


Accordingly 





Karl Pearson 


177 


(15) T lt j(4 

We may rearrange (xlviii) in the form 

Tr x +i, r i ( x ) ~ T, t , t,+i (#) - pj T *| + x ^r h t, (®) (xlix). 

Put ri * 0, t* = J, and we have 


But by (7) 


Ti, \ ( x ) = T 0l } (x) - ^ crt'o, j (x). 


A»W- = 


1 e* r® 


2ir«(i)J2, 

1 e* /f a 
2* Pd) Vo 


e - '‘wldu 


e^uldu ■ 




2*P(J)VJo Jo / 

-ai4 r(4,(1 - jr '- ( * ,) ' 

where V is the incomplete T-function ratio*. 

Similarly T 0 , j (x) - A A .<*> **(1- J' to ($)). 

Thus 7V| (*) - 4 IK 1 - **(!)) - *(1 - A* (#))}- 

The values of /'a* (8) and must be found from the Tables of the 

Incomplete Y -Function. 

We may, if we please, work from tlie probability integral of the normal 
curve, for 

<»> - a J/ - ^ 1)4 * 


Hence writing x{t + 1) = £w 2 we have 

,JW 2* Pd) Js-J* 2 V2 


-MV-/ 2 *PU)W 2V2 

and then integrating twice by parts we find 

'Ai(x) = gj|^ | Vxe~* + (* “ ®* $ 0 “ “s-/*) ‘ • 

Having regard to the values of To, -i ( x ) * n (3) and To,j(®) in (6) this may be 
written 

yaW-jpJljroW-fJV-iW. 

so that Ti,i(x) can be found from 2o,j(«) arid To ,- \{ x )- 

[%-«»«- du 

* The symbol I' ie here used to distinguish the integral ratio / 0 r - = /',(«). from the 


quantity 


fnSfi 

ity I (u,p)=J- r . 


e~ v v p dv 


actually tabled in the work Tables of tlie Incomplete r -Function, 1922. 


To obtain Vz {q) from those tables we must take w=r/g and p=q - 1. 


Biometrika xxv 



1 78 Applications of the Double Bessd Function 95T Tl(T( (x) 

It has not been my purpose in this paper to enter into the general mathematical 
theory of Double Bessel Functions ; that may no doubt be of interest to the pure 
mathematician. My purpose here is solely to deal with one type of Double Bessel 
Function, the 9f TliT| (x), or in the more suitable form of our purpose the T, h , t (x) 
function. This function arises naturally in the consideration of some important 
statistical problems, and my only purpose in this paper is to develop those 
properties of T, hH (x) which are necessary, if we wish to compute tables by aid of 
which it will be possible to give a rapid practical answer to the problems in 
question. 

In a second practical part of this paper I hope to provide the tables required 
and indicate by illustrations the type of problems they are designed to solve. 



MISCELLANEA. 

(1) Adjustments for the Moments of J -shaped Curves. 

By W. PALIN ELDERTON, C.EE., F.I.A. 

When statistics expressible by the exponential y«*y 0 e“*/* are stated in groups for each equal 

/ h fih 

y^e-^dx ; / y^-*l*dx ; etc. ; or y 0 <r (1 -e ~ h ^) ; 

;/ 0 o- (1 -e~ k l*) y ( ,o- (1 - g-*/«r) ; etc These terms may also be regarded as a geo- 

metrical progression* the first term t>eing y 0 <r (1 - e~ hl(T ) and the common ratio e^ h K It follows 
that if wo treat the areas as a geometrical progression extending to infinity, calculate the moments 
on this assumption and read the result as graduated terms of a geometrical progression, we shall 
reach correctly graduated areas, and we can subsequently write down the equation to the curve 
with little trouble. 

Other points are however involved. Let us write the geometrical progression as ka x and put 
A =(1 -a)"" 1 , then the moments about its moan are 

2nd moment A 2 -A, 

3rd „ 2A*-aA 2 + A, 

4th „ VA*-18A* + 10A 2 -A y 

and if wo work out ft and ft we get 4 + h 2 /^ and 9 + A 2 /^' respectively. 

Using the exponential, the moments, etc. about the mean are: /i 2 = <r 2 , fi^Orr 4 , ft = 4, 

ft = 9. 

Henco when we calculate moments, assuming that the statistics form a geometrical progres- 
sion, whereas they are really areas from a curve, and seek to choose the type of curve from 
Pearson’s criteria in his system, we shall reach a persistent error. For this purpose the ft and ft 
found from the statistics should be reduced by h 2 /fHs • 

This rule can be used as an approximation in all /-shaped curves and will be found to give 
satisfactory results. 

So far we have assumed that we know the start of the curve and that all the bases of the 
areas are of equal size. If this does not apply wo can, in the case of an exponential curve, fit the 
curve, excluding the first (incomplete) term, and regard that term as related to an appropriate 
base extrapolated from the graduation of the remainder. This is an arbitrary arrangement but 
has practical advantages. 

In other /-shaped curves in similar circumstances the first step would be to assume an 
exponential, to find therefrom approximately the base of the first incomplete group, and then 
assume that the area is concentrated at the middle point. This will generally give good results : 
the assumption of the exponential overstates the base and the assumption of half-way assumes a 
less rapidly falling curve than the /-shaped forms of Types I and III. There is therefore 
a balance of error. 

* “Geometrio&l progression” is used throughout to desoribe a discrete series and exponential ourve 
to describe a continuous one. 


12 — 2 



180 


Turning to the statistical side, the example on p. 106 of Frequency Curvet and Correlation 
gives ^=2*046, ft=?4*629, fta=9*502. These figures come from the unadjusted moments, and 
deducting *49 from the above values for ft and ft we reach 4*14 and 8*01. The theoretical values 
when an exponential curve is to he used are 4 and 9. 

If we apply the rule as an approximation in other /-shaped cases we find that in the example 
on p. 109, where a twisted /-shaped curve is given, /i 2 =4*266, ft ~ *761, ft=2*646, and the 
adjustment leads to ft = *527 and ft = 2*412. Hence 5ft - 6ft - 9 becomes - *098 instead of - *368. 
The theoretical criterion would load us to exj>ect 5ft ~ 6ft -9-0. 

These examples are not, of course, complete evidence, but they show that the suggestion may 
load to aocurate results, and it has the merit of simplicity. The rule with rogard to the adjust- 
ment of the fts by A 2 /p 2 may l>e combined with the approximations given on p. 106 of Frequency 
Curvet and Correlation, where it is mentioned that tho mean is overstated, when /a 3 is positive, 
A 2 

by about ^ ^ <r, and the second moment about the true menu (i.e. tho mean as corrected by 

A 2 

A 2 /(12<r)) is understated by about jg^ 0-2 * Since 9 unknown, these quantities will l>e (as they 


small) r-~ —j V H aud j 


A 2 


In this form the dimensions are maintained, and we see that 


i itt*" 

the degree of approximation is measured by A 2 /(12/X2')* If A be takon as a unit and tho moments 
found in terms of A, i.e. in working units, the corrections are 1/12 sjy.{ and ^ as stated in the 
work just referred to. 


(ii) Note on Mr Palin Elderton’s Corrections to the Moments of J-curves. 

By K. PEARSON. 

Mr Elderton has deduced his empirical rule from the exponential curve and suggested that 
it may give good results for all /-curves. Such curves include those of finite range, and those 
having constants fur removed from those of the exponential curve. Tho simplicity of the rule is 
so intriguing, that I thought 1 must try it on one or two curves far removed from the exponential, 
those curves having known fts, and computable frequencies. 

I propose also to compare the accuracy of the results with inomenj; corrections by other 
methods. 

First Distribution. Consider the curve 

/6*808 

This is a Type XII /-curve of limited range, for x runs from -6*808 to +3*192, or the range is 
10 units, with an asymptote at the end of the range. The curve is what Mr Elderton terms a 
twisted /-shaped curve, i.e. it risos vertically at tho a?* - 6*808 end. We may write the curve in 
the form 

/ x \°' 6 

and its constants are thon 

Mean = 7’5, <r = 2'5, ft = l, ft =3. 

We need not trouble alxmt y„. 

Dividing the range 10 into 10 equal subranges, we find by the Tables of the Incomplete 
Beta- Function the following frequencies for a total of 10,000 : 


Subrange 

0- 1 

1-2 

2—3 

3—4 

4-5 

5—6 

6-7 

7-8 

8-9 

9 -10 

Frequency 

138 

267 

368 

468 

677 

70fi 

870 

1109 

1540 

3958 



Miscellanea 181 

Taking momenta about the centre of the group 705, 1 find for the crude moments coefficients 
v/ -=1-9165, i/ 2 '— 9-5977, v s '=25-fl039, p 4 '= 134-9077, 
leading, after transference to the crude mean , to 

S'«.7-4105, p/ = 5-9247, p,'= -15-49947, ^ 4 '= 109-669, 628. 

To get ns' positive we change the direction of the axis of x and measure from the other end 
of our curve, thus 

2-5835, ^" = 5-9247, w " = 15-49947, ^"-=109-669,528. 

We have now to follow the rule and subtract rr # o- from ic". But we do not know <r, and are 

12(r* ’ 

compelled to use 2*434,071. Thus — r-— ** — -, whereas tho true ii 2 would give 

12 29*208,852 r b 

s* ‘033,333 as against -034,230. 

The mean by Elderton’s Rule is thus : 2-549,167. 

The correction is thus not large enough ; it would have l>oen smaller still had we used the 
true but supposod unknown <r. 

Tho 0’s from the unadjusted moments are 

ft"= 1*155,141, =3124,304. 

Elderton’s Rule bids us subtract A 2 //x 2 . Here wo can use the corrected To ascertain this 
we must go back to j/ 2 ' = 9 5979 and subtract from it the square of the distance of the corrected 
mean from 5-5, i.e. (7*4105- 5*5)* ■» (1*91 65 ) 2 = 3*672,972, to this we are to add ^ Thus 

<r*«9-5979 - 3*672,972 + *083,333 «= 6*008,261 , 
or rr = 2-451,175. 

Further, 7t 2 /cr 2 «* 166, 4375. Accordingly wo have 

£,'=•988,7035, 2-957,8665. 

We can now collect these results and compare them with the values obtained by various 
methods. 


TABLE I. Results for Distribution, Curve I. 


Character 

True 

Values 

Unadjusted 

Values 

Elderton’s 

Adjustments 

Pearse’s 

Adjustments 

Martin’s 

Adjustments 

Mean 

2-500,000 

2-583,500 

2-549,167 

2499,970 

2-529,440 

Standard Deviation 

2*500,000 

2*434,071 

2-451,175 

2*499,079 

2-472,198 

ft 

1 *000,000 

1-155,141 

*988,7035 

*994,249 

1-048,328 

ft 

3*000,000 

3-124,304 

2-957,8665 

2*987,057 

3-033,454 


We have shown how the values provided by Elderton’s Rules are obtained. Miss Fearse’s 
adjustments are given in Biometrika , Vol, xx A . pp. 314 — 355, and in Tablet for Statisticians and 
Biometricians , Part II. pp. clxxxvii—ccvi, with the requisite Tables XXXVIII — XLI, 

* Mr Elderton tells me he would have used the unoorrected but this is smaller than the cor- 
reeted and accordingly since the £’s are already overcorreoted, it would have increased the dis- 

crepancy. 





182 


Miscellanea 


I have to thank Mr E. S. Martin for the arithmetical work. In case any reader should care 
to follow out the work, I give his chief results. Writing down the frequencies in reverse order, 
we have 

Wi-3958, w 8 »1540, *,-1109, fl 4 *870, fl fi « 705, fl 0 - 577. 

Turning to Table XXXVIII, he found for 

4, ^-511-90; ?«* 6, fl*,=576*59; ?«6, *1,-621 01. 

Clearly q must be taken as -5. Table XLI then gave the /T’s without interpolation as 
-2814-3025, A' 2 « 21 24*0472, Af 3 = - 1783 9437, if 4 « 1634*1329. 

Now the first frequency was excluded, and moments taken for the remainder about the start 
of the range of second frequency n 2 '. Using the formulae, Tables for Statisticians , p. cxc, the 
values 

-1 -499,970, - 8-495,305, Ms '= 47*041,306, p 4 ' - 299*250,473 

were found. 

Transferred to the centre we obtain 

- 6-245,395, p 3 - 1 5-562,793, p 4 « 1 16-510,025, 

and ho a and the /9’s. 

It will be noticed that the whole of this moment calculating work must also be done when 
using Elderton’s Rules. But the adjustments are briefer in the latter than looking up q and the 
K \ s from Miss Pearse’s Tables. It will not as a rule be requisite to interpolate. 

The last column of Table I gives the results of Mr Martin’s not yet published method. By this 
method there are two unknowns to be previously found before computing the moments, namely 
the degree of asymptoticity as determined by Miss Pearse’s q> and further the position of the 
asymptote — lying in the first subrange — is assumed to be unknown. This double ignorance 
renders the method less exact than Miss Pearse’s which supposes the position of the asymptote 
known. 

It is clear that both tho Pearse* and the Martin methods give in this case more accurate 
results than Elderton’s. It remains then for the judgment of the individual investigator to 
determine whether the increased accuracy is or is not worth the somewhal^ increased labour. 

Second Distribution. Our first curve was one with low ft values. I thought it desirable to take 
an extreme case in the opposite direction, namely a ./-shaped curve with high ft values. I sought a 
curve with ft = 10 and ft —20, roughly, which would have fairly easily determined constants, and 
of which the areas could be found without double interpolation into the Tables of the Incomplete 
Beta-Function. A Type VI ./-curve was taken with equation 

100,000 x 2*166,966 

This gives a total frequency of 100,000, with the following constants : 

Mean (from x ** 1 ) — *035,714, <r — *053,342, 

ft - 1 1 *206,897, ft - 21 -844,013. 

By aid of the transformation 1/(1 - z) ) wo can convert the curve into an incomplete ftfunction, 
i.e. ft (*5, 15), and thus find tho areas. But difficulties present themselves, if we are to have a 
workable number of sub-frequencies. The curve is extremely leptokurtic, and if we confine our- 
selves to the number of Borne 25 sub-frequencies, the subrange h must be of magnitude about *04. 
This causes about 72 % °f the frequency to fall on the first subrange, and there is a long tail 
wherein if we only proceed by units of frequency it is difficult to know where exactly to place 

* The Pearse method would doubtless have given still closer results, if we had used, as we ought, 

also abruptness coefficients, at the far end of the range, where the ourve is perpendicular to the x-axis. 



Miscellanea 


183 


them. To add to the difficulties interpolation by usual methods into the /3-function Tables at 
the upper end of the curve is unsatisfactory. After certain modifications of the 0-function have 
been made*, and tho first sub-frequency checked by actual expansion, the following system was 
obtained : 


X 

Frequency 

X 

Frequency 

1*00 — 1*04 

71796 

1*48-1*62 

23 

1*04—1*08 

15022 

1*62—1*56 

15 

1*08— 1-12 

6438 

1*56-1*60 

10 

1*12—1*16 

3108 

1*60—1*64 

6 

1*16-1*20 

1600 

1*64—1*68 

4 

1*20-1*24 

861 

1*68—1*72 

3 

1*24—1*28 

480 

1*72—1*76 

2 

1*28—1*32 

275 

1*76—1*80 

A 1 B 1 

1*32-1*36 

161 

1*80—1*84 

1 1 

1*36-1*40 

97 

1*84-1*88 

0 0 

1.40—1-44 

59 

1*88—1*92 

1 0 

1*44 — 1*48 

37 

1*92—1*96 

0 1 


A and B are two alternative schemes for the arrangement of the tail. But they provide so 
little final change of significance when tried by the Pearse method, that it is clear that the three 
final units would have to be scattered far more widely apart than appeared reasonable to get a 
closer approach to the observed higher moment values. As scattering increases the 0’s, it was of 
no advantage to try the Scheme B, see Table II, with Elderton’s rules as they are already too 
large. The reader must remember that with Pearse’s method we neglect in the first place the 
first frequency and find moments about x—\ '04, but with Elderton’s method wo find moments 
about jfsal. I give the final results in Table II without supplying the intermediate links. I may 
note that Pearse’s q lies between *4 and *5. If we interpolate for it, we find y** *478,766. Our 
last column in Table II is given to indicate whether it is worth the trouble of interpolating for 
q and the K% 

TABLE II. Results for Distribution , Curve II. 


Character 

True 

Values 

Crude 

Values 

Elderton’8 

Adjustments 

Pearse’s Adjustments 

Soheme A 
, = •6 

Scheme B 
j=-5 

Scheme A 
Interpolated q 

Mean from ,r=l 

*035,714 

*042,721 

•040,061 

*035,721 

*035,741 

*035,584 

<r 

*053,342 

*049,934 

053,365 

*053,241 

*053,249 

*053,296 

ft 

11*206,897 

13874,989 

13*313,147 

10*809,219 

10*832,196 

10*780,903 

ft 

21*884,013 

24*332,380 

23-770,538 

20*373,483 

20*465,015 

20*036,753 


Elderton’s Rule gives the best result for the standard deviation t, but the deductions are seen 
to be insufficient in the case of the mean and the two 0’s, when h fonns so considerable a part of 
1 7 . We should need to go to a higher approximation. Pearse’s results indicate that wo gain 
nothing of real significance by interpolating for q y a process which of course increases the labour 

* I have to thank Mr E. C. Fielier for aid in this matter. 

t Hie rule would give a worse result for the standard deviation had he obtained a better result for 
the mean. 




184 


Miscellanea 


of adjustment. Again, we may note with satisfaction that little difference is made by increasing 
the scatter of the tail, or the problem of where best to concentrate the last tail unit is after all 
not of great importance. Pearse’s adjustments for the third and fourth moments give too low 
values for the /3’s, but, taken as a whole, for these fin as well as for the mean they yield more 
accurate results than Elderton’s method. It is, of course, an application to a most extreme 
case, and there is little doubt that Elderton’s method would give improved results were the 
corrective terms taken to a higher approximation. We must not, however, forget that it is pre- 
cisely those distributions which give an overwhelming frequency in the first subrange (incomes, 
house property, etc.) that cause the greatest trouble with our graduation. For such curves hj<r is 
not a very small quantity, and we cannot afford to retain only the lowest power of the quantity 
appearing in the adjustment. 


(ill) A General Expression for the Moments of Certain Symmetrical 
Functions of Normal 8amples. 


By R. C. GEARY, M.Sc. 


In a paper entitled “ The Moments of the Distribution of Normal Samples of Measures of 
Departure from Normality,” R. A. Fisher * has developed a technique for the calculation of 
moments of functions of the type k p /k 2 ^ 2 , where 


1 71 


W — 1 i=:\ n- 1 


. V »a = y< / “ 3 

etc., 


with ^ =»#< - wij = ( 1 — x t - 2' ^ (ii), 

\ n J j=H n 

and where the represent the measures of n elements drawn at random from a normal universe. 
Fisher has given the exact formulae for some of the lower moments of k s /k>fl 2 and kJLJ 1 . The 
object of this note is to show how these results may be derived by an alternative process and to 
write down a general formula for these moments. In what follows, the moments of the functions 
m P lm 2 p ^ will be considered : the lower moments of the corresponding Fisher functions can easily 
l>e calculated therefrom. Square brackets, [ ], indicate universal arithmetic mean values, about 
a fixed origin, not about the sampling mean. In the present problem the universal moan and 
standard deviation may be presumed zero and unity respectively, without loss of generality. 

First, a general formula for the moments of rn p% calculated from samples of n drawn from a 
normal universe , will be calculated. The method is that which C. C. Craig t has used to show how 
the semi-invariants of moments of samples drawn from any universe may be derived and by 
which he has calculated the exact values of the first four moments of m s and m 4 . 

Craig’s method, as applied to the normal case, starts with the following identity in t : 

[e*]aeW (iii), 


* Proceedings of the Royal Society , A, ISO (1981), p. 16 seq . 

t “An Application of Thiele’s Semi-Invariants to the Sampling Problem.” Metron , Vol. vn, N. 4 
(1926), p. 8 seq. 



Miscellanea 


185 


from which it follows, since the x t arc independent, that 




(iv), 

upon substitution for the Si as given in (ii). 

It is immediately evident that, in the normal cose, [m/] is zero if p and r are both odd 
numbers. It will only be necessary to consider the case of rp even. 

Now 

[,»/] = ^[(SV) r ]=-,r 8 — = l S •;(”) [V* ...(V). 

I »J w rL\ J J<>'Lsr i = r'-,! •••'•«! J » r Sri = ,' 'l I W L J 

The means in the last expression can now bo calculated by identifying the coefficients of 
ti pr t ...t e pr * on both sides of (iv) : 

1 


[y 1 . . . 3/Vj 1 _'=*( 1 \ (2y - 2 9 ) ! 
pr^\,..pr t \ 2«*« 0 \n) (?-«)! 


8 1 ! (jw t - 2*,) I (pv- 2#,) ! * 


.(vi), 


with 

Hence, from (v) and (vi), 
//A pr 


r 


\rp = 2 ^ 


M = 2 ^ 5 ( .f =r 


i b-.jpr, ! / _ iy- (2y -2 <) ! ^ 

,!...r c ! h~o \ »/ (?-«)! 


- 2*j) ! ...^ ! 2^) 1 

(vii). 

Fisher* has shown that in the normal case a simple relation subsists between the moments 
[£/] and the [k/jk a tf ]. His method of proof, depending on the properties of differential operators, 
is somewhat complicated. The following method indicates the genesis of the relationship. 

By an orthogonal transformation of the original variables x of which one of the transformed 

variables is X u *= i 2#,, followed by a transformation into generalised polar coordinates of the 
s/n 

remaining w— 1 variables X u ..., X n -\, of the differential element 

Win/ <-i 

it is easy to show that the mean the second moment wi 2 ( — - p 2 , where p is the radius of the 

polar coordinates) and the n- 2 polar angles are all independent. Now it is clear that the 
functions m^/ro^ 2 , upon transformation, are explicit functions of the polar angles only and are 
therefore independent of m 2 . Hence 

.(««> 

From the known distribution of t, the last factor is as follows : 




( w--l)(w + l)(n+3)...(n + 2 y-a) 


nfl 


■•(»)• 


From (vii), (viii) and (ix) the required expression for \in p'/mtf] can be written down at once. 

The property that m 2 and m p Sm i ^‘ L are independent may also be used to demonstrate certain 
simple relationships between two-dimensional semi -invariants of the type Ski (wt 2 , m p ) when l is 
kept constant aud k is allowed to vary. The probable existence of these relationships was 
surmised by C. C. Craig l from the form of the lower semi-invariauts. The (wij, m p ) are given 
by the identity in a, £ : 

22 S kl 
"kill 


r am 2 + /Swip. 

l e J* 

* Op. cit. p. 27. 

t 41 Student,” “ The Probable Error of a Mean,” Biometrika , Vol. vi. (1908) p. 1 scq. 
t Op. cit. p. 61. 


.<*)• 




186 


Miscellanea 


The first term may be written 


* Af akf * 1 

k+l = o kill 


rm 2 * +,p ^i 

with ^ki — [^ 2 *^;)*] ™ 

sitico m 2 and m^/m^ 2 are independent for normal samples. Hence, from (ix), 
n-l+lp n+\+lp w + 2£ — 3 + Z/j 

jy 2 2 Si P 21 

Mh “ 7m ■' 1 x [»»p ] i 


n-1 


from which it follows that, if a is so small that 2a <«, 

n - 1 + Ip 


22 

*+/= 


.rr'^H'Hn 


(xi). 


Expanding the logarithmic and binomial terms in the exponent of the last expression and 
comparing the coefficient of a k 3 l with the corresponding coefficients in the second side of (x), it 
will bo seen that 


and 


£*o 


*-11, ls /2\* 

2 ^(n) 

>%, V 1 + 2 t:3 , i + 0 

n* 


(xii). 


These results are in agreement with those for S k i (m < 2 , m,,), /•, ?=0, 1, 2, 3, 4 and />*3, 4, from 
which Craig surmised the general results given at (xii). 


(It) A Statistical Study of the JDaucut Oarota L. 

By WILLIAM DOWELL BATEN 
(University of Michigan). 

The object of this article is to compare two samples, each of one thousand, of the Wild Carrot 
taken from the roadside in Michigan and Indiana. The sample from Michigan was taken near 
Ann Arbor during the summer of 1930, while that from Indiana was taken near Terre Haute 
during the summer of 1931. 

The Daucus Carota, or Wild Carrot, is a weed which grows profusely over the north-eastern 
and north-central parts of the United States. I have found it growing in Michigan, Indiana, 
Kentucky, West Virginia, Virginia, District of Columbia, Maryland, Delaware, Pennsylvania, 
New York and Ohio. This plant is from one foot to five or six feet in height and has its flowers 
arranged in umbels. At the ends of the tall and rigid stems are enlargements or knobs from 
which flower arms or rayB grow. At the end of each ray is a composite flower made up of many 
tiny white flowers of different sizes. The entire inflorescence containing flower arms with their 
flowers is from one to four or five inches in diameter and as a whole resembles delicate white 
lace. This resemblance is no doubt the source of the name Queen Anne’s Laoe. Around the knobs, 
at the ends of the stems, grow the rays which are in rows, there being more rays near the base. 
Flower arms at the centre of the cluster are much shorter than the others and contain few 



Biometrika, Vol. XXV, Parts I and II 

Baten, A Statistical Study of the Daunts Carota /,. 


Plate I 



4 Bracts 


Daucus Carota L. 




Miscellanea 187 

flowers which are sometimes pink and purple* The accompanying Plate I shows a flower duster 
and also a cluster after the rays have turned inward and the seeds are maturing. 

Beneath the bottom row of the flower arms are found green bracts which resemble sepals. 
They are slender and are made up of pointed branches which vary in number from one to seven 
or more. These branched leaves hug close to the lower rays while the duster is young, but grow 
downward after it reaches maturity. 

The following presents the chief characteristics of the distributions of the number of bracts 
from both samples and also those for the distribution of the number of flower arms. The signi- 
ficance of the means of these distributions are determined, together with the linear correlation 
between the number of bracts and the number of rays for the sample from Indiana. 

1. Chief characteristics of the distributions of bracts. 

The following tables give the frequency distributions of the number of bracts from the 
samples from Michigan and Indiana. 


Number of 
bracts 

Michigan 

Frequencies 

Indiana 

Frequencies 

4 

1 

0 

5 

7 

0 

6 

8 

0 

7 

41 

0 

8 

303 

98 

9 

224 

143 

10 

140 

159 

11 

127 

205 

12 

93 

201 

13 

52 

189 

14 

1 

3 

15 

2 

2 

16 

1 

0 

Totals 

1000 

1000 



Michigan 

Indiana 

Mean 

9*463 bracts 

10*857 bracts 

Mode 

8 

ii 

Median 

9-123 „ 

10*985 „ 

Range 

13 

8 

Standard deviation 

1-71016 „ 

1*61510 „ 

Skewness 

iti/in non r\P mno via _____ 

0-660702 

Difference of Means 

-0*215047 


^ . i/iuoivuvu w* ivawftio 

Significance of means - ProbaWe £rror of Difference of Means 


= 27'88 

The two distributions differ iu several ways. The range for the sample from Michigan is 
13 bracts, while the range for the Indiana sample is eight. There were no flower dusters on the 
Indiana plants which had less than eight bracts, while there were 57 plants from Michigan 
which had less than eight bracts per cluster. Most of the clusters from Miohigan had nine or 
less bracts per cluster, while most of those from Indiana had 11 or more bracts; yet there was 
one duster from Michigan which had 16 bracts while the highest number from the other sample 




188 


Miscellanea 


was 15 bracts. 72*4 per cent, from Michigan had 10 or less bracts per cluster, while 75*9 percent, 
from Indiana had 10 or more bracts per cluster. 27*6 per cent, from Michigan had 11 or more 
while 60 per cent, from Indiana had 11 or more. 56 clusters from Michigan had 13 or more 
bracts per cluster, while 194 from Indiana had 13 or more bracts per cluster. 

A very good idea of how these two distributions differ is also manifested by skewness ; that 
for the Michigan sample was plus *560702, while that for Indiana was minus *215047. One distri- 
bution is skew to the right and the other is skew to the left. 

Histograms in Diagram I help the eye to distinguish these differences to some extent. 



Diagram I. 


The significance of the means shows that the two samples were not the result of random 
sampling. The probability that one mean would differ from the other so much suggests that it 
is almost impossible for these samples to be taken from the same parent population at random. 
This significance is nearly 28 times the probable error of the difference of tho means, which shows 
clearly that the samples are not consistent with each other. 

2. Characteristics of the distributions of the flower arms. 

The following table gives the frequencies relating to the flower arms in the clusters of the 
flowers of the Wild Carrot. 



Miscellanea 


189 


Number of 
rayB 

Miohigan 

Frequencies 

Indiana 

Frequencies 

Number of 
rays 

Miohigan 

Frequencies 

Indiana 

Frequencies 

13 

1 

0 

53 

14 

36 

18 

1 

0 

54 

18 

58 

20 

1 

0 

55 

19 

26 

22 

3 

0 

56 

17 

50 

23 

9 

0 

57 

5 

37 

24 

7 

0 

58 

4 

31 

25 

9 

0 

59 

9 

20 

26 

11 

0 

60 

4 

41 

27 

19 

0 

61 

1 

21 

28 

24 

1 

62 

2 

28 

29 

22 

0 

63 

5 

12 

30 

36 

4 

64 

0 

17 

31 

28 

0 

65 

3 

8 

32 

41 

4 

66 

1 

22 

33 

32 

4 

67 

0 

13 

34 

30 

7 

68 

1 

17 

35 

40 

3 

69 

2 

14 

36 

45 

5 

70 

0 

9 

37 

33 

5 

71 

0 

5 

38 

48 

22 

72 

0 

8 

39 

35 

9 

73 

0 

4 

40 

57 

28 

74 

0 

7 

41 

35 

13 

75 

0 

2 

42 

40 

* 34 

76 

1 

6 

43 

30 

23 

77 

1 

2 

44 

34 

35 

78 

0 

2 

45 

34 

17 

79 

0 

1 

46 

34 

36 

80 

0 

4 

47 

23 

22* 

81 

0 

2 

48 

26 

53 

82 

0 

5 

49 

21 

24 

83 

0 

2 

50 

33 

44 

84 

0 

2 

51 

26 

44 

89 

0 

1 

52 

19 

49 

105 

0 

1 




Totals 

1000 

1000 


Mean 

Standard deviation 

Skewness 

Probable error of mean 
Significance « 44’71 


Michigan 
40*512 f. arms 
9*1880 „ 
*3795 

•1933 f. arms 


Indiana 
53*509 f. arms 
101214 „ 

*5142 

•2162 f. arms 


The means for the above distributions show dearly that the plants from Indiana have larger 
flower clusters on the average, which could not be detected by the casual observer or detected as 
one sees the plants along the roadside. In appearance the two flower clusters seem to be alike. 
There were no clusters from the Indiana sample that had 27 or less flower arms, while there were 
58 from the Michigan sample. There were no clusters from Michigan which had more than 77 
flower arms, while there were 20 from Indiana which had more than 77. There were only two 
clusters from Michigan with more than 69 rays, while there were 63 clusters from Indiana 
with more than 69 rays. Two-thirds of the distribution from Michigan contained less than 
45 rays, while eight-tenths of that from Indiana contained 45 or more rays. Only 56 dusters 
from Michigan had more than 55 rays, while more than one-third of the sample from Indiana had 




of HOWER ARMS. WILD CARROT. 


ieo 


Miscellanea 





Number of Flower 


Miscellanea 


191 


moro than 55 rays. More than nine-tenths of the distribution from Michigan lies below the mean 
of the Indiana distribution. More than nine-tenths of the distribution from Indiana lies above 
the mean for the Michigan distribution. 

Histograms in Diagram II show clearly how the distributions differ, as to rauge, means, and 
the nature of the distributions at the ends. The large frequencies of the distribution from 
Michigan correspond in a measure to the small frequencies for that from Indiana and vice verm . 
The two distributions nearly coincide at 44 and 47. Tho facts that have been stated show clearly 
that the Indiana plants produced larger flower clusters than the plants from Michigan. 

The significance of the means is 4471, which certainly shows that the two samples were not 
due to random sampling from the same parent population. Just why these samples differ so 
widoly I cannot say. 

The Table on p. 189 shows that the frequencies for even numbers of flower arms are greater 
than those for odd numbers. This is true for the clusters from Michigan and Indiana. The 
fact is well exhibited on examining the histograms on page 190. For the Indiana sample the 
frequencies for the numbers in the first line are found in the second lino. 


37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62 
5, 82, 9, 28, 13, •% 2?, 26, 17, 88, 22, 68, 24, U, 44, Ifi, 36, 58, 26, 50, 37, 31, 20, U, 21, 28 


This riso at the even numbers and fall at the odd numbers can be easily scon by examining the 
frequency polygons of original data. Histograms ill Diagram II show this very clearly. In tho 
large majority of tho cases the frequencies for the even numbers are larger than the frequencies 
for the odd. The sum of the frequencies for the even numbers is 610, while the sum of the 
frequencies for the odd numbers is 390. 

Just why there are larger frequencies for the oven numbers of rays I cannot say. The fact that 
this plant is a dicotyledon may have something to do with it. Botanists perhaps can explain this 
phenomenon, 

HiEQUENCY of FLOWER ARMS in GROUPS of FIVES. WED CARROT. 



192 


Miscellanea 


Dividing the distribution into groups of twos does not remove many of the irregularities 
caused by tho predominance of the plants with even numbers of flower arms. Grouping by 
throes eliminates almost all of the irregularities, yet gives distributions with two modes. 

In Diagram III the two series of data are divided into groups of fives. In each case a Type 3 
curve has been fitted to the data. The following table presents the data in groups of fives 
together with the computed frequencies from tho Type III curves. 

Observed and computed Frequencies of Rays in Clusters of the Wild Carrot . 


Classes 

Frequenoy for Michigan Bample 

Frequency for Indiana sample 

Observed 

Computed 

Observed 

Computed 

10-14 

1 

•3 

0 


15 -19 

1 

3 

0 


20-24 

20 

24 

0 

— 

25 29 

85 

78 

1 

1 

30-34 

173 

160 

19 

12 

35-39 

201 

215 

44 

53 

40 - 44 

196 

204 

133 

123 

45- 49 

138 

154 

152 

186 

50 54 

110 

89 

230 

198 

55-59 

54 

45 

164 

168 

60—64 

12 

18 

119 

118 

65 69 

7 

6 

74 

72 

70-74 

0 

2 

34 

38 

75-79 

2 

1 

13 

18 

80-84 

0 


15 

8 

85—89 

0 

— 

1 

2 

105-109 

0 

— 

1 

3* 

Totals 

1000 

999*3 

1000 

* 1000 


* Greater than 89. 


The histograms for the distributions in groups of fives together with the corresponding Type III 
curves again show that tho plants from Indiana produce flower clusters with a larger number of 
rays. 

The Typo III curve fits the Michigan sample letter than a Type III curve does the Indiana 
wimple. The tall column near the mean no doubt causos this poor fit. 

3. Correlation coefficient between bracts and rays. 

While examining the clusters the question arose as to whether clusters with a large number 
of rays also had a large number of bracts. While counting the rays and bracts, clusters were 
found which contained 13 bracts and 46 rays, also others were found with 13 bracts and 82 rays ; 
also some with eight bracts and 30 rays and others with eight bracts and 69 rays. Tho correla- 
tion coefficient was considered to be the answer to this question. The Pearson linear correlation 
coefficient betwoen the number of bracts and tho number of flower arms for the sample from 
Indiana is 


r***624f. 

t -630, if Sheppard's corrections be used for the standard deviations. 



Miscellanea 


198 


Correlation Table. Flower Arm and Bract*. 
Number of flower arms per cluster from sample from Indiana 



S'* 

1 

% 

& 

? 

a 

A 

I 

J 

3 

i 

<0 

s 

i 

<0 

nr 

i 

8 

i 

I 

$ 

1 

58 

105—109 

Totals 

15 

U 

IS 

IS 

11 

10 

9 

8 

1 

1 

2 

3 

4 

9 

1 

3 

13 

14 
13 

2 

5 

19 

31 

39 

37 

8 

21 

29 

29 

42 

23 

30 

41 

66 

49 

32 

12 

37 

48 

48 

22 

7 

2 

2 

38 

40 

28 

10 

1 

0 

37 

25 

8 

1 

2 

l 

1 

20 

8 

2 

1 

2 

1 

3 

1 

8 

6 

I 

1 

2 

3 

189 

201 

205 

159 

143 

98 

Totals 

1 

19 

44 

133 

152 

230 

164 

119 

74 

34 

13 

15 

1 

1 

1000 


This coefficient does show that there is a rather definite relation between the number of bracts 
and the number of flower arms per cluster. This means that on the avorage clusters with a small 
number of bracts will also have a small number of flower arms, and those with a large number of 
bracts will on the average have a large number of flower arms. On examining the data it was 
soon seen that there were only three clusters with eight bracts which had more than 54 rays. 
There were two which had 15 bracts and these had 70 or more rays. Moat of the plants with 
eight bracts contained 44 or less rays per cluster, while most of those with 13 bracts contained 
GO or more rays. The above table shows how the clusters were distributed for the number of bracts. 

The following equation, obtained by the method of least squares, gives the regression straight 
line of number of flower arms on bracts : 

10*506 +3-901**, 

where y represents the mean number of flower arras in the curve and x represents the number of 
bracts per cluster. This line is plotted in Diagram IV. 

The following table gives the average number of flower arms per cluster for the respective 
number of bracts per cluster. 


Number of bracts 

Average number of rays 

8 

43*22 

9 

46-59 

10 

49*08 

11 

53*27 

12 

58*62 

13 

02*19 

14 

67*00 

16 

77*00 


These points have been plotted in Diagram IV and lie very close to the regression straight line. 
The correlation of *024 is far from perfect correlation yet is much further from no correlation. 
The above shows that the plants with a small number of bracts also have a small number of 
flower arms and the, plants with a large number of bracts have a large number of flower arms. 
The above table shows that the average number of rays for all clusters which had eight bracts 
was 43*22 rays, etc. 

* For number of bracts on flower arms the regression line is £=5*597 + *0908 y. 

Biometrika xxv 18 





194 


Miscellanea 


REGRESSION LINE of FLOWER ARMS 

per CLUSTER on NUMBER of BRACTS. 


380 


WILD CARROT. 

Equation to Regression Line 
y*10’505 + 3-961 X 


9 10 11 12 13 

Number of Bracts, (x) 

Diagram IV. 


O 



The above study was made to show that natural phenomena may be analyzed statistically to 
a great advantage and that this type of study brings out many interesting details which might 
bo overlooked. 


Seeds which were taken from the two environments have been planted under the same con- 
ditions in the Botanical Gardens of the University of Michigan. Samples of one thousand will 
again be compared. The experiment may extend ovor several years*. 

The following table gives the .number of flower arms per bract for the sample from Indiana, 
for the plants with 8, 9, 10, Dl, 12, 13, 14 and 15 bracts respectively. 

Average number of 
flower arms per bract 


Plants with 8 bracts 5*15 


11 

11 

9 

ii 



S k* 

518 

11 

11 

10 

ii 



4*91 


11 

11 

ii 



4*84 

11 

11 

12 

ii 



4*87 

11 

>1 

13 

ii 



4*78 

11 

11 

14 

n 



4*79 

11 

11 

15 

n 



5*13 


. *.. M ' ™ Py6ameb ° kar haB Btndiod this P*°blem- Bulletin of Applied Botany of Genetics and Plant ■ 
breeding (Russian Journal), Vol. xxvz. 1981, pp. 194—252. 



Miscellanea 


195 


This table shows that for the plants with certain numbers of bracts the number of flower 
arms on the averago is about five flower arms per bract. When all the flower arms and all the 
bracts were considered the number of flower arms per bract was 4*93, which is on the average 
about five. 

When all of the flower arms and all the bracts were considered for the Michigan sample 
the number of flower arms per bract was 4 28, which shows again that tho two samples differ 
widely. 


(v) On a Property of the Mean Ranges in Samples from a Normal 
Population and on some Integrals of Prof* T. Hojo. . 

By Prof. V. ROMANOVS KY, Tashkend. 

I. 

Studying the paper of Prof. K. Pearson “ On the Mean Character and Variance of a Ranked 
Individual and on the Mean and Variance of the Intervals between Ranked Individuals” in 
Biometrika , Vol. xxm. pp. 394-397, I discovered a property of the mean ranges in samples 
taken from a normal population which deserves some attention. 

This property may be formulated as follows. 


Let hi pvt 


then 


/ ■ f p ‘ ^ X *dx 

sf 2 tt J -*> 
m Jxa x tn ~ l da xy 


.(i); 


representing the mean range in samples of size m from an infinite normal population with zero as 
mean and unity as variance . , verifies identically the relation 

+ 1 / * i V . (j 2 -1 _ a. ( _ i \»m 1 = 0 ( C h ~ m L A ( 

4w + 2 m 4m w 4m-2 + l ; 2m + 2 \ w k\(m-h)\) 

for «i=l, 2, 3 

The demonstration of this identity is very simple. 

Let us introduce the quantity 




and consider the integral 

/„= -«*)"• 

J 9 

From (1) we obtain 

« x — 4 + ( e~k* 2 dxs*^+u x , 

v 2 tt J o 

putting 

j e~i x> dx, 

_ vsiiryo 

and, therefore, 

«x”‘ (i - *«)"■ - (* + «*)’“ (4 - «*)’" - (i - ' 

is an even function of x. 

Now 


.(4). 


jeQ-ujy*e-l x *dx, 

and it appears at once that 7 m , as the integrand represents an uneven function, is identically 

/m-0. 

/*»"= J o *a x m da x - CJ xa x w + 1 da x + . . . + ( - 1 ) m xa* m da xy 


zero : 

But, from (4), 


13 — 2 



196 


Miscellanea 


and we see, remernlieritig the definition of A m , that 

A m - Cm A,n + 1 + C m 3 A m + 2 — ... + ( — 


•(ft), 


+ 1 . 
2 m + 2 


CJ - 


,+c m 


3 „ -L ( _ 1 Nm ^**n ■ + 1 as 0 

* '** + ( ; 4m + 2 ’ 


2m + 4 1 vm 2m + 6 
q.e. d., because this is only a slightly different form of (3). 

We may remark that (5) may he written in the condensed form 
A OT A m ss =0 (ill ** 1) 2, 3, ...) . 


.(5 />/«). 


This relation can bo applied to the verification of the tables of w m calculated by Mr Tippett. 
Another application is this: for the calculation of X, w we need only to calculate directly 
Xj, X 3 , X r „ ..., for X 2 , X 4 , X 0 , ... we shall find from 

X, — X 2 * 0 , X 2 — 2 A 3 + X 4 = 0 , 

X 3 — 3 X 4 + 3Xf, *■ \q — 0 j etc. 


IT. 


Having discovered the relation (5), I naturally tried to find similar relations for the integrals 
T, I y Ity Sy K of Prof. T. Hojo ( Jiiometrikuy Vol. xxin. pp. 325 — 326), which, as it can be assuredly 
supposed, required inuen laborious calculation. Now it can be shown that this work could l>e 
reduced to a half owing to some simple recurrent relations between Prof. T. Hojo’s integrals. 


The integrals in question are obviously of two types : 


and 


-L r 

v2t r J -• 

Wm ' P ' q ~^/2^rj a x ma qx e px ‘da'y 


■( 6 ), 

.( 7 ), 


where m— 1, 2 , 3, p> 0 and q are any real numbers. For simplicity we shall denote them as 
II m and o’ m , omitting other indices which will be supposed given and constant, 
bet us take the first of these integrals. 


We may write a x 1n 41 ( 1 - a x ) m — (4 4 - u x ) m + 1 - u x ) tn 

«K-(i 


As u x is an uneven function of Xy wo must have 


J Ux (1 “ «* 2 ) m e * **'dx - 0, 

and therefore 

L. j «,-»+• (l-^m [" ,)»e-»*dx, 

V 2 IT J -V; V 27 T J -<» 

or, expanding both integrands by Newton’s theorem and integrating term by term, 

• H'Jm* 1 ~ +( — T) m /7 W ] .(8), 

which may be written also as (8 bis). 


This relation is well verified by the integrals Ty Iy S and K of Prof. T. Hojo, which shows the 


exactness with which these integrals were calculated for Table I of his paper. 

Quite similarly, starting from the identity 

«* m (1 -«*) m <h*~\ (i~u*r+u qx (*- n»*r, 

where u gx ^ A, we find 

^2w"~ + +( ” l) W f^m 3=1 M %»»*“■ + ••• + (— 1 ) m H OT ] (9), 

or A ' (m«l, 2, 3, ...) (9 bis). 



Miscellanea 


197 


The relation (8) shows that Hi, II i, H it /r fl , ... being known, we shall have #7, ... 

from (8). And when all the 2 Tb are calculated, we need only to calculate directly G\, On ••• 
in order to have <7 a , #4, G^ ... from (9). 

I shall concludo with two further relations which can be of use. Lei 

1 f* 

* >— I a, ni x n e~ pri dx i 

m and n being any positive integers and p any positive real number. We easily find the relations 


^,*,+ 1-0 ( 10 ), 

A m i w+J . a »=iA"M ra , lf * (11), 

the differences being taken in respect to m. 

Evidently many relations of the types considered in this note can be established which will 
be useful in researches like those of Professors K. Pearson and T. Hojo. ♦ 


(vi) Note on the Shrinkage of Physical Characters in Man and Woman 
with Age, as an illustration of the use of x a , P Methods. 

By PAMELA 0. V. LESSER. 

It is known that when large series of measurements are taken on adult men and women the 
chief physical characters tend sensibly, if but slightly, to decrease with age, and further in Staturo 
and Brain- Weight this shrinkage appears to lie greater in woman than man*. This shrinkage is 
too slight to be adequately determined on short series, though even there it will be found to 
exist. The present note is not intended either to measure the relative shrinkage in man and 
woman, or to determine whether they are really due to the same causes. Its purpose is to 
indicate on comparatively small samples how it would be possible on more numerous data, to 
answer the problem of whether shrinkage in physical characters can be attributed to the same 
Hot of causes acting with the same intensity in man and woman. 

Considering first only Brain-Weight and Stature, I have taken my data for the former from 
measurements on Bavarians of the two sexes f, choosing the age of 20 years as clearing the primes 
of both, and as the graphs of brain-weight indicate^ the start of a slight but continuous shrinkage. 
The data for stature were taken ultimately from Retzius’ “ Ueber das Himgewicht der Schwedon,” 
Biologinche Untersvchungen , N.F. Bd. ix (1900), but I have used the tabulation provided by 
Pearl §. There was no occasion to adopt the same races for the two investigations, and it would not 
have boon feasible to do so as statures to ages were not given for the Bavarians in Pearl’s paper. 
Pearson’s diagrams || for stature and age indicate that shrinkage with age begins about 20 years 
for both sexes. 

A. We will consider in the first place Brain-Weights. 

Here for men the mean brain-weight of the younger group is 1 364*06, and of the older group 
1347*67, showing a shrinkage of 16*38 grs., which could not be demonstrated as significant on 
these numbers. 

* See K. Pearson, 14 On our present Knowledge of the Relationship of Mind and Body,” Annals of 
Eugenics, Yol. 1. (1926) pp. 887, 390. 

f Bisehoffs data from Das GMmgewicht des Menschen, 1880, as tabled by Raymond Pearl, 
Biomtriha, Yol. iv. p, 100. 

$ Pearson, toe. cit. p. 890. 

(! Loc. cit p. 387. 


§ Pearl, loc. cit . p. 88. 



198 


Miscellanea 


TABLE I. Brain - Weights for Age Groups in Bavarians 
Men (weight in grains) 


Age 

1000- 

1160- 

1200- 

1250- 

1300- 

1350- 

1400- 

1450- 

1500- 

1600- 

Totals 

Group 

1149 

1199 

1249 

1299 

1349 

1399 

1449 

1499 

1699 

1699 

60—80 

4 

9 

16 

27 

25 

35 

22 

11 

9 

6 

164 

20 - 49 

6 

7 

38 

47 

59 

78 

54 

34 

31 

11 

365 


Women (weight in grams) 


Age 

800- 

950- 

1050- 

1100- 

1150- 

1200- 

1250- 

1300- 

1350- 

1400- 

Totals 

Group 

949 

1049 

1099 

1149 

1199 

1249 

1299 

1349 

1399 

1599 

60—80 

3 

6 

8 

19 

11 

14 

! 

14 

3 

6 

1 

85 

20—49 

0 

2 

14 

22 

45 

54 

55 

23 

13 

10 

238 


For the women the younger group has a mean of 1244*88 and the older group one of 1189*21, 
indicating a shrinkage of 55*07 grs., a larger amount than in the case of the men. Dealing first 
with the men, we may suppose : 

(i) That the parent population (from which we consider both samples to be drawn) has its 
relative frequencies given by the sum of the columns. The corresponding value of x 2 is 

2 „ NX' \N X') 


whore N and N f arc the sizes of the two samples, and a*, w/ the corresponding frequencies in 
the flth category. We find v 

X“coi=»-«^79. 

As in our case wo have taken 10 categories we have 


N+N' 


/, X 3 ool=' 452 - 


(ii) That the parent population be that which givos the highest probability of the two 
samples being drawn from the same population. We will write the corresponding x 2 + as 


X 


XX ' 
N+N 



n„ 

X 


rsj 



and find 


Jfmln. - 3*3451, and P 


Th ur our data for mon are inadequate and do not enable us to assort that brain-weight 
shrinks with age in the males. They do, however, indicate that x a min. and * 2 col may give widely 
different measures of the pi*obability of the two samples coming from the same parent population. 

Turning to the women, we find 

X 2 *, -36*4146, and x 8 min- 19*6177, 

corresponding respectively to 

^Vooi. “ *9002, and 


Biometrika , Vol. vm. p. 260. 


f Biometrika , Vol. xxiv. p. 469. 




Miscellanea 


199 


Thus by the first method of X*ooi. we should be prepared to accept the hypothesis that the old 
and young women have not been drawn from the same population, i.e. that the shrinkage with 
age is demonstrated. But by tho second method which gives the parent population of maximum 
probability we should be more doubtful of the truth of this hypothesis, and some might, with tho 
limit P>' 02, bo prepared to consider that they might well be drawn from the same parent 
population. This example again illustrates how important it is that we should actually take into 
account what is the parent population we are supposed to be dealing with. 

Wo havo now at our disposal four x 2 \s, namely 

A^ool. A^min. 

Men 8*8470 3*3451 

Women 36*4146 19*6177 


and we will denote the female values by adding a dash. 

Wo next ask if the x s col ’s or tho x 2 mln /s will the better enable us to determine whether the like 
causes are at work in the cases of men and women. We may do this in two ways*, either by 
considering tho improbability of a ratio greater than x’lx 2 or a difference greater than | (x a ~X 8 )* 
Our results give 


X ^cul ./ X Col. 


30*4146 
~ 8*8479 


4*1156, 


xVn./x'W = “ R ' 86 46 ' 


and 


WU ~XU) =13-7833, 
i (X' 2 mln. ~ x'mln.) = 8‘1363. 


From Table II of Biometrika t wo deduce 


Ax'Wx’U) “’ 2/ -19S48 (4-5, 4'5) = ’0469, 

/> (x' a m«„./x ! m in.)“ 2/ -14«67 (4'5, 4’5)= -0147. 

From these results we should conclude that on the basis of choosing for both men and women 
the most probablo parent, populations, it is oxtremely likely that the shrinkage of brain in women 
is not due to the same causes as in the case of men. But if we took the parent population to 
have relative frequencies determined by the columnar marginal totals, this conclusion would be 
very doubtful. Considering that our data actually do show a significant difference between tho 
brain-weights of young and old women, and do not show the like for men, the formor conclusion 
certainly appears the more reasonable. 

Turning now to tho difference method we find, by Table I in Biometrika J , 

'*( tx'WlxVa =2 (-5 --499,915) = -0002, 

- lAta.)-* ( - 6 - *493,968) —■0121. 


Probably (since / > <* 02) we should conclude from both these results that a real difference 
exists between the eases of men and women. But the discordance of ^(Jx^coi. - hx 2 oai.)~ an< * 

/, (x' a «oi./x s «.i,) = ' 046a - and the accordance of P(^ mhL - 4x a ,.un.) = ‘ 01 21 with/ V’mi u ./x a m. n .)” , ° 147 . 
certainly suggest that it is better to use the most likely parent population than that obtained 
from the samples combined. 

B. We next take the case of Stature. Here, as in the case of brain -weights, adequate data 
indicate a definite if small shrinkage with agc§. The mean stature of the men from 60 to 
80 years is 168*831, and of the men from 20 to 50 169*504, indicating a shrinkage of 0*67 oms. 


* Biometrika , Vol. xxiv. pp. 304—830. t Loc . cit. p. 847. 

% Loc. cit. p. 844. 

§ See diagrams, Annalg of Eugenics , Vol. ii. p. 100, and Vol. hi. p. 291, as confirming those of the 
same journal, Vol. i. p. 387. 



200 


Mucdlanea 


In the case of the women the like groups give 157*113 and 158*417* or a shrinkage of 1*3 oms. 
These values accord with experience from more numerous data, but standing alone cannot be 
considered as significantly demonstrating anything. As my purpose is not to demonstrate 
anything but ouly to illustrate the use of methods, I have not sought for long series. The 
shrinkage is, of course, less in stature or span than in weight or in the dynamic characters. It is 
particularly notable in the case of vital capacity*. 

Table II gives the results for stature in a small group of Swedes t. 


TABLE II. 


Men (stature in cms.) 


Ago 

144- 

153- 

159- 

162- 

166- 

168- 

171- 

174- 

177- 

180- 

188- 

Totals 

Group 

152 

158 

161 

164 

167 

170 

173 

176 

179 

182 

191 

50 — 80 

8 

8 

11 

18 

26 

26 

28 

13 

12 

6 

3 

154 

20 — 50 

4 

4 

18 

29 

46 

45 

i 

38 

32 

32 

8 

6 

262 


Women (stature in cms.) 


Age 

132- 

144- 

147- 

150- 

158- 

156- 

159- 

162- 

165- 

168- 

171- 

Totals 

Group 

148 

146 

149 

162 

155 

158 

161 

164 

107 

170 

176 

. 

50 — 80 

2 

1 

8 

9 

24 

21 

12 

15 

12 

2 

0 

U)6 

20—50 

, 

2 

5 

6 

5 

20 

19 

18 

23 

16 

1 

9 

4 

127 


We proceed first to find the x a col and x 2 m in. for *nen. They are 
X 2 coi. •8*8532, leading to 7*= ‘540. 

X a min~3*5031, loading to 7*= *964. 

Hence, proceeding by cither method, we conclude that no evidence can be* drawn from these 
small samples of a shrinkage in stature with age. 

Turning to the women we find 

X U. « 14*6957, loading to 7 J =*145, 

X 2 nun. ■* 8*7271, loading to 7^*559. 

We therefore draw tho Bame conclusion for women as for men, but remark that the 
probabilities aro in both cases for women very much less than for men. These four results again 
emphasize tho importance of considering what paront population is under consideration; the 
most likely j>arent imputation giving markedly higher probabilities of no distinction between old 
and young in both men and women. 

We now consider whether the values of x 2 ’« reached show any difference in the case of men 
and women. 


We have, if dashed letters refer to women, 


14*6957 

8*8532 

87271 

3*5031 


1*6599, 

2*4912. 


* Soe Annals of Eugenics, Vol. xi. p. 105, and Vol. in. p. 296. 
t Biometrika , Vol. iv. pp. 88—89. 




The corresponding probabilities are for the ratio test : 

“2/. 3 789 5 (5, 6) =‘437, 

/, (x' 3 ml„./X a m l n.)= 2/ -286« ( 9 > 9 H' 166 - 

The x 2 's of the most likely populations give a lesser probability that the causes are accordant 
in men and women than the columnar populations, but in the case of both we cannot assert that 
women differ from men. 


Proceeding now to the difference test we have 
i(xV-xU) —2*9212 1 
W'nin-X'nln)* 2*6120 / 


leading respectively to 


^Ix'W-ixV) =205-334, 215)-332, 
Ai x'”min. - i xW) = 2 (' r > - '308,648) = '3B3. 


By this method there is little to choose botween the two types of parent populations. From 
both we should conclude that we could not assert nu the data any difference between men and 
women. This result might have been more or less anticipated, as our data wore too sparse to 
distinguish in either sex between young and old. It will be noted that the ratio test gives in this 
case results varying more with the parent population selected than the difference test does, but 
no stress can be laid on this. 


It is intended to discuss the problem suggested in this note more fully later on the ample 
data from Galton’s first Anthropometric Laboratory. Meanwhile it seemed worth while to work 
out from that material a special case because it illustrates how divergent may be the conclusions 
to be drawn from the minimum parent population and the columuar totals population. We 
deal with the case of Vital Capacity and Age. 


Men (Vital Capacity in cm. 8 ) 


Age 

50- 

140- 

156- 

170- 

185- 

200- 

215- 

230- 

245- 

260- 

275- 

290 and 

TotalB 

Group 

139 

154 

169 

184 

199 

214 

229 

244 

259 

274 

289 

over 

20—31 

50 

53 

114 

244 

272 

469 

456 

414 

248 

218 

114 

153 

2805 

31 and over 

141 

155 

159 

338 

319 

447 

352 

304 

133 

151 

64 

59 

2622 


X^mln. — 
X*col. m 


NN‘ 

jV+N 1 




149*4902. 




By both methods P— *000,000 but there are several more zero’s before we come to a 

significant figure in /^s then in P x \\ n : Undoubtedly there is a shrinkage in vital capacity 
with age in men. 

Women (Vital Capacity in cm. 8 ) 


Age 

27-5- 

82-6- 

92*5- 

102*5- 

112*5- 

122*5- 

182-5- 

142-5- 

152-5- 

162-5- 

172-5- 

182-5 

Totals 

Group 

82*4 

92-4 

102-4 

112*4 

122-4 

1324 

142-4 

152-4 

162-4 

172 4 

182-4 

and over 

18 — 29 

22 

17 

42 

49 

86 

108 

126 

100 

71 

59 

41 

41 

762 

29 aud over 

82 

46 

89 

99 

130 

111 

110 

81 

69 

42 

25 

25 

909 


XV -71-1734 and fam 92*2206. 




202 


Miscellanea 


In both cases P — 1)00,000 , although again the 7^2 mln will give a somewhat greater 

probability than the P x 2 ml . Both demonstrate with overwhelming probability that women’s 
Vital Capacity shrinks with age. But however overwhelming the probability, it is greater in the 
case of men. 


We now turn to the fundamental problem : Cau this shrinkage in men and womon be attributed 
to the same set of physical or physiological causes % 


We apply first the ratio test. We have 


These load us to* 


X' 2 e0l./X 2 «*0l. 


149490217 

71*173415 

248-221460 

92-220486 


= 2*1004, 
= 2*6916. 


/> (x ,2 min./x a min.) =i2/ *322f>4 (5*5, 5*5) = 2342, 
7 VWx 2 cui.) ~ 2/-27089 (5*5, 5*5) = *1154. 


While the ratio of the y a , njn /s gives, as we might ex}>ect, a higher degree of jirobability than 
the ratio of the x 2 «*oi .% there is nothing in either to suggest that it would not be reasonable to 
treat the shrinkage in men and women as due to the same set of causes. A glance, however, at 
the curve for n categories on p. 308 of Biometrika , Vol. xxiv, shows us that the point correspond- 
ing to the two lies inside the curve, or that the difference test will provide a much more 
stringent test than the ratio test. 


We have accordingly t 

£(X\^.-X'W)=39-158,396, 
i(x' 2 .-ol.-X%.) = 78*000,485, 

and /'(* a [ -5 - ' S V. (ix' 2 min. - 4x 2 , ■„„.)] 

= 2 [*5 - something considerably greater than — *499,993]. 

Thus wo have /’(* - x * mll ,)) < «K),01 2. 

In the same manner P(\ ( x 2 n,i ~x 2 coi)) < *900,01 2, 

and considerably less than P(^ (x /s min - X“min ))* ** 

Tt cannot be doubttxi that if Table I for S 1n (,v) had been carried further, we should liavo 
found both Ph less than 1 in 1,000, (XX) at least. Wo conclude therefore that with the more 
stringent test tlio shrinkage of vital capacity with age is not due to the same causes in man and 
woman, but it is a secondary sexual character very possibly due to differences not only in their 
physical environments, but in their physiological life. 

This example illustrates the oxtremo importance of applying in each case the more stringent 
test whether it bo the % v h ratio or their difference. We may well ovorlook tin important 
conclusion, if wo do not bear this in mind. In this particular example there is no marked 
diversity in result if we uso x 2 iulll or * 2 coL . But this will not always be so, and in the case of 
shrinkage of brain-weight in women our conclusion will l>e far less assertive in the case of 
1 X 2 mjn ^ aT1 in that of Pyi . 


* Using Table II, Biometrika , Vol. xxiv. p. 848, with v = n = 12. 

t Using Table I, Biometrika , Vol. xxrv. p. 845, where we see S 6 (39*158,896) and S n (78*000,485) lie 
well beyond the limits of the table. 



Miscellanea 


208 


(vii) On the Distribution of Student’s Ratio for Samples of 
Three Drawn from a Rectangular Distribution. . 

By VICTOR PERLO, M.A., Columbia University. 

Let samples be drawn from a continuous distribution of finite range, the chance of a value 
lying in a given interval within this range being proportional to the length of the interval The 
distributions of most of the important statistical measures for this type of population are not 
known. This paper presents the distribution of t for samples of three, and some comparisons with 
Student’s distribution. 

The statistical measure t is defined, for samples of », as Vn/*, where x is the sample 

J _ J;\2 

mean, m is a trial value of the true mean, and a 2 = — — --- . For samples of three, then, 

n — 1 

t as (.r - m) sfsi*. 

If we regard a sample its a point in three dimensional space, we find by methods similar to those 
used by H. L. Rictz in determining the distribution of the standard deviation of similar samples*, 
that the distribution of t is determined by that of the angle between a cube’s diagonal and the 
radius vector drawn from the on lie’s centre to a point within the cube, which reduces to the 
problem of computing the volume within the cube of a conical shell with axis the cube’s diagonal 
and vertex the cube’s centre. We get for the distribution ordinate : 

-9 / 1 31 \ . 3i«2+2), Jp~- 4 

iWl < 2 -4/ 


4(<+l)(<*-4) 

(i-*n 

2(4-P)s/l-P\ 4-/v 

For 2 the last term of (1) becomes 

3* (P+2) 
(4 - «*)* 


+ ?*-«•+?) tan 
(P - 4)i 

+ : l*-V + t P tanh 
(4-P)* 


-J 3(« + 2) 

/\r* 

V 4_fi 


(® ><>*) (1), 


( 2 ). 


tanh* 


j J 4- 1* 
^3(< + 2) 


to preserve reality. 

The distribution is continuous with continuous derivatives except at ±£, whom the 
derivative lias points of discontinuity. This function may well be compared with “Student's” 
distribution for samples of three drawn from a normally distributed population. Plotting shows 
it to lie greater than “Student’s” at the ends and the middle, and less elsewhere, that is, more 
leptokurtic. Also interesting is a comparison with the extremely simple expression for the 
probability ordinate for samples of two, as derived by Rider, 

1 t 

2 (i+W* ' 

Let P 1)0 the probability of a sample having 1 1 1 > 1 | , where t Q is some fixed value of the 
argument. Geometrically it is twice the volume of a cone of angle $o( < />“ t /(0) within the cube 
(i.e. the cone about the cube’s diagonal with vertex the cube’s centre plus its opposito). Integrating 
the expressions (1) and (2) over the appropriate intervals, dropping the subscript, and assuming 
t positive, we get 

— -+ . r~ . (« ><>i) ( 3 ), 


-9 , 3l{ _ 

1- — y tan 

2 (<+l) (P-4) (P - 4)1 

„ JHiJl-P 3i< 

+ 4 -P (4 - P)i 


>/a (t+ 2) 

/i^p 

V 4 ft 


(i>t> 0) 


.(4). 


* H. L. Bietz, Note on the Distribution of the Standard Deviation, etc., Biometrika , vol. xxm. 1981, 
pp. 424—426. 

t Paul B. Rider, On the Distribution of the Ratio of Mean to Standard Deviation, etc., Biometrika , 
vol. xxi. 1929, pp 124 — 148, especially pp. 140—141. The formula in Rider’s paper is inoorreotly stated 

as - - -A -j . Corresponding errors appear in the cumulated probability expressions. 

2(l-|t|) 3 



204 


Miscellanea 


For calculations of the ordinate and P rational approximations to (1) and (3) arc obtained by 
expanding the inverse tangent in powers of its argument. For (1) we got 
8< + !) 4 ~t < s +2 

3(i+l)»(/ + 2)* + 12(<+1) 2 (< + 2)< + 15(< + 2) 6 

_(t»+_2)(<-2) (t>+2)(«-2)» 1 *_ 

63(( + 2)« v l) (2n+i)(i + 2) n+s 3* + s " r 

(3) gives in a similar expansion 

3 (21 + 8) _ ?_+<('“*). +/.iy , 

2(« + l)(< + 2)* 3(* + 2) 3 M5(* + 2) 4 ; 3*- 1 (2w + l)(* + 2)* + * r *‘•• 


For £ >2, the first three terms of these series give results correct to at least four places. 

Let p be the probability that t exceeds a given value obtained from ‘‘Student’s” distribution 
corresponding to the true P found above. The fiduciary limits most frequently applied in testing for 
significant deviations of sample means from true means are *05 and *01. The following table shows 
values of t for which p or P take on those values. 


t 

P 

P 

4*30 

•0500 

•0774 

574 

•0292 

•0500 

9'90 

•0100 

•0204 

14-85 

•0045 

•0100 


nrsL terras ox tne 


The limit of P/p as t approaches infinity is found by comparing the 
extensions in powers of t~ l of (3) and the expression for p } 1 - -- - — — . The limiting ratio obtained 


•p 3 
is ~ ~ 27207. 
2 


(viii) The Distribution of in Sample* of Four from a Normal Universe. 

By A. T. McKAY", M.Sc. 


The estimated value of for samples of 4 is the statistic 

4 

*/3 2(A r -A*) 3 



* 


■ 0 ). 


N OW 2 (#,. - X ) 3 « { (a*j - xf + (jPg ~ A’) 3 } + {(#3 - + (#4 - A’) 3 } (2), 

1 

and factorising each curled bracket separately we find 

(x l + x>> - 2ir) A + (#3 + x 4 - 2x) B, 

where A and B represent the remaining factors. But obviously {xi+x^-Zx)** - (x a +x 4 - 25?), 
hence this term is a factor of the expression (2). By taking a different grouping of terms in (2) 
we may readily show that 


2 {x r (a?! + x 2 - x 3 - a 4 ) (x x +x 8 -x 2 -x i )(x l +x i ~x< i -X 3 ) (3). 


In equation (1) let us now omploy the following orthogonal transformation 

y 0 ^(x 1 +x 2 ^x 3 ^x i )/2 ' 
y 1 r=(Aj+A* a -^3~a? 4 )/2 

i- 

ya = (# i+A-3-*A2-A- 4 )/2 


which yields 


0 ^ yi 3W 

1 fi(?/i , +y, s +y 3 s )]* 


(4), 

.(5). 



Miscellanea 


206 


Hence in virtue of the well-known orthogonal property of thb normal function, we see that our 
problem reduces to seeking the distribution of the statistic 8 X in samples of three from a normal 
universe. By partial differentiation of (5) we find that there is au absolute maximum of &\ when 
yi“y 2 =y 3 , thus the distribution of 8 ly and therefore of 8, has termini at ±1. This means that 
tho distribution of terminates at ± 2/V& Since the distribution is symmetrical about zero 
we have only even moments, which are given by 

(e) - 

Changing to |>olar coordinates we derive 


* j 3 h f%ir fir f** 

H 2n ($i) — j J j ( cos ^ s * n55 ^ cos 4* 8X11 4>^ n sxn & drd8d<j> (7). 

Tho integrand of (7) is separable, and the use of standard formulae readily yields 


Pan (0l) = c 


3» \ 2 / 


2, r (6n+^ 


.( 8 ). 


Hence H (fix) = A and pi (dj) = * 7 oW> and thus making allowance for tho factor of Jl in 
equation (1), « y and = JgS } giving JJJ. These values are confirmed from 

an expression of H. A. Fiahor’s* 


Let us now proceed to find tho distribution of 8\. From equation (5), we see that it is 
necessary to integrate 


—^e-Wdi/idy^dys (9) 


over the fiold of integration for which 

<w+ttu; (10)> 

( 2 /)* 

or transforming to polar coordinates, we require to integrate 


over the field for which 


— r* e~ r“/2 dr sin 6 dd dd> 
(2*r)t 


w<~ sin 2 d cos 6 sin 2</> <w+&w. 

ml 


•(ii) 

( 12 ), 


where and 0 ^ </> ^ 2ir. 

Now since (12) is independent of r, tho latter variable can be integrated out in (11), hence the 
subject of integration is thus 

sin 8 d8 cty (13). 

« 

Regarding w as positive, which is merely equivalent to treating the positive half of the 
distribution, and writing cos 8~x, we have to integrate 


over the field for which 


^ dxdfj), 


at 


? < *jr- A* ( 1 — #*) sin 0 < IT + bw 
2 ■* 


.(14) 

,(16), 


* R. A. Fisher, Proc. Soc. Ser. A, Vol. 180, No. a, 812, or E. S. Pearson, Biometrika t Vol. xxn. 

Paris hi. and iv. 1931 (Miscellanea). 



206 Miscellanea 

where now 0 ^ ,v ^ 1 and 0 ^ <£^ 7 r/ 2 . Performing the integration with respect to <£, we see that 
we have to effect the integration 

\ jfa dx (16) 

over the field Oigx^l conditioned by 

2w-3*.j: (I-**) sin <£>0 (17). 

The equation (Ifi) thus becomes 

\ -**))<& (18), 

whore the limits for x are determined from the fact that x rums from 0 to 1, subject to the 
condition that 

0^2ir/3t.v(l-^) <$1 (19). 

Designating the distribution function sought by /(w), i.e. the expression (18), wo have thon 

J Xi (2<>) ’ 

where ,r t and are to In? determined from (19). 
liy changing the variable in (20) we have 

/(»’)■=- f f ’ — — (21), 

* J t, V< (27< (1 - tf—Avr 1 ) 

where t x and t 2 arc to be found from 

0$4^/27*(l-2) 2 ^l} ^ 

o^i^i J V h 

Thus, from (21), we conclude that /(wq is a complete elliptic integral, for the limits are singularities 
of the integrand*. 

In equation (22) we observe that the element between the two inequality signs has a minimum 
value at 2®= J, hence for our limits of the integral we require the two roots of the cubic 

272 (1 - 2) a — 4«> 2 =0 (23) 

which lie nearest to, and on each side of, the value 2 = J. This will be readily seen from a rough 
graph. Solving the cubic (23) by the usual method, the appropriate roots are found to be 

2, « jj cos* ( i cos - 1 w+ tr/3) 0 < 2j < J \ ^ 

2 2 = J cos 2 (J cos“ 1 w + 2tt/3) ^ < t 2 < 1 » * 

In (21) let us now make the substitution to a new variable y defined by 

2»J cos 2 (J cos' , y + 2»/3), 

tl-n /(»)- 2 (25), 

7r3i J -w V(l“ y*)(y*— w 8 ) 

or putting y**\f 1 — .r 2 we have 

/(*)- \( +a ^k ^£±tl*)da, (26), 

tt3» J -o y'(l-* s )(0*-* 2 ) 1 h 

where Q 2 ~l — w i . Expanding the sine term and noting the disappearance of the “odd” part we 
have 

J ; 3 *Jo y/(l -&)(&-&) (27)> 

* Whittaker and Watson, Modem Analyti e (1920), see § 22*32 et teq. We might infer from this that 
/ (it) will prove to be a hypergeometric funotion. 



Miscellanea 


207 


Writing in this .r— sin 6 and Q = sin a, 

2 f a cos 0/3 dB 

3w J o Vsin^ a - sin 2 0 
_ 2 ^ 2 a cog <£/(J cft ft 
3ttJo V 2 (cos 0 — cos 2a) 

But the latter is Mehler’s Integral* for the Legendre Function, hence 

/(w) « J /* _ ^ (cos 2a) 



whore /’(a, 6; c; x) is the usual hyj>ergoometric function notation. 


(28) 

(29). 


.(30) 

(31), 


Just as a check on our analysis we may now seek the moments of the distribution by 


proceeding from (31), 

M*. (*,) = ?, ^ - l\ 1 i 1 ~“’ 3 ) <*» (32) 

= r, /" sin 2 " 0 cos 6 F (( , 3 ; 1 ; cos 2 #) cM (33). 


Expanding the hypergcoinetrie function and integrating term by term we find 



Whence by employing the Gamma Function Triplication formula t with argument (n + £) we 
derive the result stated in (8). 

From equation (31) we conclude that the distribution of V#i in samples of 4 from a normal 
universe is 

ii. i ; 1 -•?•**) (30). 

Now it is known that when a 4- b - c~0, the hypergcoinetrie function F(a t 6 ; c ; t) is convergent 
when \t | <1 and divergent when t** 1, whence we conclude that the distribution <j>(x) has the 
following properties : 

(i) Symmetrical. 

(ii) A cusp at infinity when a?— 0. 

(iii) A finite ordinate of 1/2 s /3=0*28867f> at the termini x— ±2/ % /3«* ± 1 *1647. 

(iv) 

The graph of the curve is shown in the figure, together with the Poarson Type curve, which 
results by using the second and fourth moments of the true distribution. The calculation of the 
latter is as follows : 


# Whittaker and Watson, loc . cit. §§ 15 '231 and 15*22. 
f Whittaker and Watson, loc. cit, § 12*15. 



Miscellanea 


208 ' 

whence using Elderton’s notation 

1-263158, a =C/Vl9= 1-376494, y 0 -O-58372, a ! - 1-894737, 

0-58372 (1 -**/l-fl04737) , -* B1 “ (37), 

with termini at x — ± 1*376494. 

Approximation to the Curve. 

By use of Stirling’s approximation in the general term of the hypergeometric series and 
cornering the approximation derived therefrom with the true coefficients obtained by direct 
calculation, it may be shown that 

(*)-«»(*) ( 39 ), 

where a = 0*275663, ^ (z) ■* 1 — a log (1 - 2 ), 

<x> /I _ e -2/0»\ 

«, ( 2 ) = (0*001488^ + 0*0001 20z 2 + ...), f 2 (*) *2 2 * v . 

i n 

Now for 0 ^ 2 ^ 1 , z n and (1 - e-*l 0n )/n are always positive and decrease steadily as n increases, 


hence by Cauchy's Integral test, 

«»(*)<* (l - « “ 2/ ") +!"**( I - e - 2 /“) y (39). 

By use of the moan value theorem and a change of variable wo find 

(2 (2)<0-19926i + 2 -e’Vj (40). 

The latter integral may be evaluated from the British Association Tables (Vol. I. Table VII) 
with the result 

<2 W <0*40972 (41), 

hence af 2 ( 2 ) - e 1 ( 2 ) <0*1 11 52 (42). 

Thus the percentage error in taking \fs ( 2 ) as an approximation is numerically loss than or 
equal to 

1 1*152/(1 -a log (1- 2 )) (43), 

so that limit {>//■ ( z ) - $ ; 1, z)} s (44). 

z-W) or l 


The expression (43) can be shown to have a maximum value of 6*22 °/ 0 at 2=0*8443, so we may 
conclude that ^( 2 ) errs in excess by less than 6*22 °/ 0 of its own value. Thus with a percentage 
error of at most 6*22 % the distribution function <f> (x) of equation (36) is given by 

(f> (x) no 0*31 1568 - 0*366466 log 10 | x | (45). 

The latter is, of course, considerably more accurate when x is very small or very near the limit 
of the range. 

By integrating equation (45) over the entire range of the distribution we obtain the value 
1 *087 instead of unity. Thus the total error of the entire area is 8*7 °/ 0 , so that when integrating 
over a subrange this error can be simply apportioned. 

The Rectangular Universe. 

Whenever the distribution of a statistic in samples of n is determined, it is always of 
considerable interest to enquire to what extent the form of the derived distribution is dependent 
on that of the parent. In our case, for example, we shall consider the distribution of in 
samples of 4 from a rectangular universe. We proceed by means of a sampling experiment, using 
the first 200 samples of 4 given by Shew hart*. Since the means and standard deviations are also 

# W. A. Shewhart, Economic Control of Quality of Manufactured Product (Macmillan, 1931), 
Appendix it. Table B. 



Miscellanea 200 

recorded in his Table E, the necessary calculation was not so difficult. The final frequency 
distribution is shown in the table below. 


Bange for 

1 JPi 1 

Frequency 

(Rectangular) 

Theor. Freq. 
(Normal) 

X 8 

* 0 — -1 

44 

31-40 

5*05 

• 1 — -2 

22 

24-28 

*21 

• 2 — -a 

22 

20-77 

•07 

* 3 — *4 

18 

18*57 

•02 

* 4 — *5 

17 

16-92 

•(H) 

• 5 - *6 

15 

15-64 

•02 

• 6 — -7 

10 

14-65 

1-47 

* 7 -- -8 

13 

13-88 

*06 

• 8 — *9 

11 

13-34 

•41 

* 9 ~ 1/0 

9 

12-61 

103 

1 - 0 — 1-1 

14 

12-16 

•28 

1 - 1 — 1-16 

5 

5-78 

•11 

Totals 

200 

200-00 

8-73 


Column 3 gives approximate values of the theoretical frequencies found by reading from the 
Curve I in the figure and using Simpson’s Rule. This is sufficiently accurate for our purpose. 
With a x 2 — 8-73 and 11 degrees of freedom we find that P( >* 2 )> ‘65, thus the fitting is a very 
plausible one. 


The Distribution, of JJS t in. samples of So ur from a Normal Universe 




210 


Miscellanea 


Summary and Conclusion*. 

1. The distribution of V& in samples of four from a normal universe is determined and proves 
to be a symmetrical ourve having finite ordinates at the termini and an infinitely distant cusp on 
the axis of symmetry. 

2. This new, inverted T, type distribution is interesting in that it shows, among other things, 
that the normal universe can give rise to a derived distribution which cannot be approximated 
to by a Pearson Typo Curve. It provides a warning, therefore, against approximating to theoretical 
distributions by the mere use of moments without first assuring, by means of sampling experi- 
ments or otherwise, that the approximation ourve selected has the same general character as the 
true curve. 

3. The results of a sampling experiment suggest that for a rectangular universo the distribu- 
tion of \/ft in samples of four could most reasonably bo the same as that for the normal universe. 


(lx) Note on Mr McKay 1 ! paper. 


I should not expect that a single Pearson curve would describe satisfactorily the distribution 
of any statistical coefficient based upon a sample of four. One has had enough experience in the 
distribution of the product-moment coefficient* and tho correlation coefficient t in very small 
samples from a normal population to realise the truth of this. On the other hand I should be 
surprised if the Pearson curves would not give a reasonable approximation as the samples 
increased from 15 to 25 {. 

Even in Mr McKay’s case the divergence is not so excessive as his diagram would suggest. 
We may look upon Mr McKay's curve not as a single curve, but as a curve and its mirrored 
imago, in precisely the same manner as I have treatod the distribution of tho “centre of the 
range” in samples of size n drawn from a rectangular parent population §. In that oase we have 
a cusp at a finite distance from the origin on the axis of symmetry, and mirror curves, caoh of 
which is a Pearson curve of Type IX. Those mirror curves are in that case the accurate solution, 
not an approximation. 

Accordingly in the present case of tho distribution of in samples of four v it seems reasonable 

to uso mirror curves as Mr McKay does in his accurate solution, and fit each of them to a Pearson 
curve by using tho second or fourth moment coefficients about the asymptotic axis. The combined 
mirror curves will then have the same first four moment coefficients about the axis of symmetry. 
We are not provided with the third moment coefficient of the half McKay curve so that one 
must be content with tho second or fourth. The appropriate Pearson curve is Type VIII, 
fa\ m 

i.e. y«y 0 (rj ♦ where x ranges from 0 to a. 


We have for moment coefficients about 2 = 0 || : 


Ms 


„ 1 - m , . (1 — m) 


5-w 


and for unit area 

Accordingly 
if wi. 


^o-(l-7a)/2af. 


\±i _ (3 -m) 3 J?_ 

pi'* § (1 - mj (B - m) **— 4’ 


* Biometrika , Vol. xxi. pp« 170 — 180. 

t Biometrika , Vol. xi. pp. 887 — 888. 

t Bee the last two memoirs just cited. 

§ Biometrika , Vol. xxni. pp. 894 — 898. 

|| Phil, Trans . Vol. 218 A, p. 488. 

If The factor | is introduced, because the area is for half the mirror curves combined. 



Miscellanea 


211 


But =■ fit of Mo Kay’s ourve=315/143, hence 

or «*= 315/43 and «= 2-7066,8113. 

Thus m= -2934,1887. 

Further, the standard deviation of McKay’s curve is and accordingly we find 

' 12 2-7065,8113 

“ "35 X “7065OT" 1 ' 3133 ’ 2501 "’ 
or 0=1-1460,0393. 

We have here the chief characteristics of the accurate McKay curve in approximative values. 

Lastly 

Thus the fit by the first four moments consists in the mirror curves of Type VIII form : 




We can, however, get a still better result by an appeal d priori to a principle which deter- 
mines the range of in samples from any parent population. 

Consider a parent population with a range b and let samples of size n be drawn from it. Now 
consider the following scheme: suppose w-2 of the sample values are at taken as at the 
end of the range, one value at x~*c (c < b) and a third at x**b. Then we have 

fa & &!£?/(, &<&=&)*. 

•=i n / Wi n ) 

Now &*=(c + b)/n and accordingly we have 

v /2j- = J- {(e(w-l)-6) 3 +(6(w- ])-<!)»- (c+6)»(»-2)} 

1 {(c (n - 1 ) - by + (6 (» - 1 ) - of + (« + 6) 2 (n - 2)}* 


(/i-2) {(c 3 + b 3 ) (ft - 1) - Scb (c + b)} 
{(?+&) (n-l)-2cb}i 
n-l-3/iX} , „ be 

i-l-2rcX)t (&+c) 2 


«(»— 2) 


When c=0, or we have one value at one end of the range and the rest at the other, 

vs-t±. 

V7i - 1 

This is the maximum value of Jfa . For if we move the value at one end closer to the n - 1 
values at the other, we merely shorten the range 6, but get the same value of which is 
independent of the range. If we put two at one end of the range and n- 2 at the other, this is 
putting c — by or X» we have 

i « - — , which is less than • 

u (n- 2) 1 


Finally, if we start with one value at one end of the range and n - 1 at the other, and 
move one of the latter out a distance c } then we have as above 

jar -*»*} 

(n-l-2nX)» ’ 

but if we make this a maximum with X we find 

n-1 -2nX*»«-l -3ttX, 

or X«0, that is c«0, or to move a value from the end of the range reduces >/&. 



212 


But the arrangement 1 and n — 1 values at the ends of the range is independent of (i) the size 
of the range, which may be increased to infinity, or of (ii) the nature of the parent population. 

Thus under all circumstances the value of must lie in the range ± . 

vn-1 

To this extent the parent population is indifferent. If n= 4, then we have for the range 
±2/^3=1*164,7005, agreeing with Mr McKay’s value. 

Knowing our range we can make use of either the second or fourth moment coefficient as we 
have not a knowledge of the odd moments to fit our curve, or we might fit one of Mr Hansmann’s 
curves* which use /i 2 , 0 2 , and # 4 . As my sole purpose is to indicate that a well-chosen Pearson 
curve can approximate to Mr McKay’s curve, I will take a Type VIII curve and knowing the 
range fit from one even moment coefficient. While I should prefer the seoond moment coefficient, 
had the range been infinite, I prefer the fourth to the second when the range is limited. There 
is, however, very little difference between the values of y 0 and m found from the second and 
fourth moment coefficients, and both give results very close to the values found as above, when 
the range is supposed unknown. We have for the constants of tho Typo VIII curve : 



Range 

?/o 

m 

Range unknown, using p 2 ' and ^ 

f Range known, using ^ 

Range known, using /* 4 ' 

1*146,004 

1*154,701 

1*154,701 

*308,280 

*299,778 

*295,291 

*293,419 

*307,692 

*318,054 


On the scale of Mr McKay’s diagram there is scarcely anything to show betwoen these curves. 
In the diagram below tho curve as found from the known range and ^ is figured. 

Thus the fit is by mirror curves of the Type VIII form : 

( 1*154 701 \ * 318,064 

LiHL°!) . 

We have here tho chief characteristics of the accurate McKay curve in approximative values, 
namely : 

(i) Symmetrical. 

(ii) A cusp at infinity when a? = 0. 

(iii) A finite ordinate =0*295,291 (instead of 0 288, 075), at the termini #= ±1*154,701. 

(iv) cr 2 « *339,030 instead of *342,857, and /a 4 the same as the true value. 

(v) logio0(0)« 1*808, 1928- *318,054 (1 + log, 0 |#|). 

An examination of tho diagram shows that the mirror curves of Type VIII, while they 
provide no very accurate fit, are substantially more satisfactory than the attempt to fit a con- 
tinuous ourve of Type II to what is actually a mirror curve. Had we worked from the three 
moment coefficients ft 2 ', and /* 4 ' of the McKay mirror curve about its origin we might have 
hoped for a still better fit, arid we have at any rate encouragement for the suggestion that with 
larger, if not very large samples, the fitting of the mirrored curve, not the pair of mirrored curves, 
by a Pearson curve will give a practically reasonable graduation of V&+. 

* Thesis for the London dootorate not yet published. Mr Hansmann fits higher order symmetrical 
enrveB by taking and of the observations to agree with the ourve values. 

*♦*. J. Pepper has shown, indeed, that with samples of 10 only, he got a very good fit to the y/pl 
distribution even without the use of mirrored curves of finite range. 





Miscellanea 


213 


Comparison of Pearson Curve of Type VIS and Mss iMirror Curve 
wMH McKays accurate Curve 



Mr McKay’s discussion of the Rectangular Parent Population is of much interest and value 
as showing that with very small samples the actual distribution of the parent population is of 
small importance, but at the same time it contains a warning to those who propose from small 
samples to deduce anything concerning the characteristics of the parent population. As I have 
endeavoured to indicate in a recent paper*, it may well need samples of upwards of 100 to safely 
infer whether the parent population is normal or rectangular. Thus the chief value of Mr McKay’s 
comparison of samples from a rectangular parent population with the theoretical results from a 
normal parent population does not seem to me to lie in the fact that the latter will suffice to 
describe the former— -the theoretical results of sampling from a rectangular population would no 
doubt equally well describe very small samples from a normal population — no, the chief value 
lies in the warning it gives that, notwithstanding we have in a small sampling found agreement 
with theoretical predictions as to small sampling from a type of parent population A, this 
provides no real evidenco that the actual parent population was not of a wholly different typo B . 

K. P. 


(jc) Note on the Fitting of Frequency Curves. 

One of the chief difficulties which beset the path of the inventor of a system of frequency 
curves is the too ready maimer in which others may apply, or rather misapply them and so bring 
discredit on a system, the rules of which they have not followed, or more often misunderstood. 


See Biometrika, Vol. xxrr. p. 871. 



214 


Miscellanea 


I could cite many instances of this in the case of ray own system of curves*, but a note- 
worthy illustration of it occurs in a dissertation for the Ph.D. of the University of Michigan by 
Mr Pae-Tsi Yuant. It is entitled “On the Logarithmic Frequency Distribution and the Semi- 
logarithmic Correlation Surface. 1 ” I am not concerned here with the question of whether Mr Yuan 
has contributed anything novel to the subject, which has been worn fairly threadbare by numerous 
previous writers. I deal only with the two points in which he refers to my own contributions to 
the topic. In a paper of 1895 J I defined the “ skewness ” of a frequency distribution to be the 
ratio of the distance between the mean and the mode to the standard deviation of the distribution, 
and in 1905 § I showed that the logarithmic curve oould not be of wide use, because the range of 
“skewness” it provides is limited, while in actual practice “skewness” can take any value 
whatever. Mr Yuan|| remarks that while the skewness of the logarithmic curve with my 
definition is limited, this only indicates that my definition of skewness does not give a satisfactory 
measure_of skewness, and advocates a 3 which is the symbol he prefers to use instead of 
Now s!fa may if any one prefers be used as a measure of asymmetry in frequency distributions, 
but that has nothing to do with my definition of “skewness.” Whether you call (mean ^ mode) 
dividod by standard deviation “skewness” or not, the fact remains that the quantity in question 
is a physical character of frequency distributions, and is limited in the logarithmic curve and is 
not limited in frequency distributions in general. J fa is not limited in the logarithmic curve, but 
for every value of fa there is only one available value of fa and of the other higher fas. The curve 
connecting fa and fa has been plotted by Protori us IT and his graph is reproduced on p. 215. 
Unless the fa and fa of a distribution give a point lying close to the broken line (L) of this 
diagram we cannot got a good representation of the frequency. If the point does lie near that 
line, I will guarantee as good a fit with a Type VI curve to any actual observational series. 

If the point (fa , {$%) lies some distance from the (L) line, its fourth moment coefficient must 
be discordant with that provided by the logarithmic curve, and the graduation will fail to be as 
good as the Pearson curve. Now how does Mr Yuan illustrate the supposed superiority of fit of 
the logarithmic curve over a Pearson curve? 

He compares the logarithmic curve with a Pearson Type III curve! Why not straight away 
compare it with a rectangle or a normal curve? The logarithmic curve lies in Type VI area and 
is uoarer to a Type V than a Type III distribution. 

For a practical illustration he takes the distribution for the weights of 1000 female students 
as follows : K 


Central Weights in lbs . 




4 

sr 

'4 

ST 

*0 

3 

4 

4 

sr 

4 

3 

*■» 

4 

4 

4 

4 

4 

3 

4 

!> 

4 

3 

*■4 

4 

sr 

4 

1 

4 

5! 

3 

£ 

Frequency 

2 

16 

82 

231 

248 

196 

122 

63 

23 

5 

7 

1 

2 

1 

l 

1000 


See for example: Paul B. llider, Biometrika, Vol. xxiv. p. 886, where no attention has been paid 
to tho “abruptness" or to the limitation of the range. Or again; G. L. Edgett, Uetron, Vol. ix. No. 2, 
pp. 31—82, who applies a wrong type (using a method similar to that suggested by me in 1885 and then 
found lacking in accuracy) and then asserts that this type ia a bad fit. But illustrations— especially 
m practical memoirs— are really too frequent to be reoorded here. 

t Published in the Annale of Mathematical Statiitice, Vol. iv. pp. 80—74. Edward Brothers, Ann 
Arbor, Mich. 

t Phil. Trane. Vol. 186 A, p. 870, 

§ Biometrika, Voi. iy. p. 186. 

II Loe. eit. p. 42. 

H Biometrika, Vol. mi. p, 147, 




Miscellanea 


215 


Diagram Showing the Relation Between y8 ( and /3, 

FOR THE LoGARITHimCALLY TRANSFORMED NORMAL CURVE. 



The constants of this distribution are : 

Mean -118-24 lbs., Mode- 112-68 lbs. 

<r= 1-691,762*, ft «* -963,403, ft =5-463,569. 

A glance at Pretorius’ diagram shows us that the point (ft , ft) is far away from the (X) line, 
and accordingly a logarithmic curve would give an entirely erroneous ft and an incorrect fourth 

• Working units =16-91782 lbs. 



216 


MiaceUanea 


moment coefficient. The Pearson curve appropriate to the data is Type IV, and the oorrespondin 
equation is 

1 2*1 12,672e +9#076 » 3G7 


y 


( i+ (3-866,7376) ) 


8\tt*12»,87» 


in working units, 


with origin at 3*411,005 working units, i.e. 34*11005 lbs. before the mean. The mode is at 
2*854,650 working units from the origin. 


The areas of this curve were calculated and the following results reached : 


Observed 

Frequency 

Logarithmic* 
Curve Areas 

Pearson’s 

Type IV Curve 

82 

231 

248 

196 

122 

63 

23 

?}» 

!)• 

1S} 10 

97 

228 

255 

190 

114 

57 

27 

> 

!)• 

iS } 16,087 

88*175 

216*122 

262*893 

199*009 

113*705 

55*934 

25*701 

ll*562\ w ..* fiQ 

5-227f 1<> -' 80 

1-137} 

* 553 l 4-685 
| *588 I 


For 10 groups: x 2 =13*1764, ^=5*5761, 

156, />(x 2 )-‘780. 

It is clear that the fit of the Type IV curve which provides the correct fio is superior to that 
of the logarithmic curve, as it naturally should bo. Mr Yuan may of couAe graduate any data 
with a logarithmic curve if he considers the fit good enough for his purpose, but it is idle and 
illogical to pick out a wrong type from my system, and then magnify the value of the logarithmic 
curvo at the expense of that system. 


Values on p. 50 of Mr Yuan’s paper. 


K. P. 






Bionu'trika ” Portrait times, No. X. Issued as frontispiece to Vol. XXV. 


Volume XXV DECEMBER, 1933 Parts III and IV 


BIOMETRIKA 

THE CRANIAL COORDINATOGRAPH, THE STANDARD 
PLANES OF THE SKULL, AND THE VALUE OF 
CARTESIAN GEOMETRY TO THE CRANIOLOGIST, 
WITH SOME ILLUSTRATIONS OF THE USES OF 
THE NEW METHOD*. 

By KARL PEARSON. 

1. Introductory . 

Many years ago when I had more leisure to give to craniometry than I have 
had recently, I became convinced of the unsatisfactory nature of the "standard 0 
planes* of the skull, and the need for a careful revision of the whole subject. In 
particular, when one had realised the absolute asymmetry of the skull in ail parts 
and in all directions, it seemed irrational to suppose that the auricular axis was 
likely to lie in a "horizontal 0 plane. Assuming for a moment that there is such an 
entity as a "median sagittal 0 plane, or plane with regard to which the absolutely 
symmetrical skull should have mirror symmetry, then in the natural skull the ears 
would be shifted right and left, forward and backward, up and down with regard to 
this mirror plane of symmetry, and accordingly the auricular axis would make an 
angle with the mirror plane of symmetry. Further, all the fundamental points of 
the skull which should lie in the mirror plane, i.e. the “mid-sagittal” points, would 
in an actual skull be dispersed some to the right and some to the left of it, and 
the fiction of a median sagittal plane as a true standard vertical plane of the skull, 
seemed to vanish with the asymmetry of the skull. 

Convinced that, with an asymmetrical system like the skull, the main object must 
still be to start from a fitly chosen median sagittal section as the standard median 
vertical plane, and that the standard horizontal plane must be perpendicular to this* 
I could only look upon the Frankfurt Horizontal Plane and the Transverse Vertical 
Plane, both passing through the auricular axis, and the Median Sagittal Plane (as 
determined by any three points f) as very temporary and inadequate expedients to 
obtain three mutually perpendicular standard cranial planes. These customary planes 
are not mutually rectangular. By construction the Frankfurt Horizontal Plane and 
the usual Transverse Vertical Plane are at right angles. Hence the third plane 
ought to be at right angles to both these planes, that is to say at right angles to 
the auricular axis. It is perfectly easy to draw the curve of intersection of such a 

* The material points of this paper were given as a lecture before the Oxford University Anthropo- 
logical Society on May 25, 1988. 

t These three points are assumed to be in the “mirror plane of symmetry,” for example Nasion, 
Bregma and Lambda as used in the Biometrio Laboratory, or Nasion, Inion and Basion as suggested by 
Martin. See p. 227 below. 

Biometrika xxv 


15 



218 


The Cranial Coordinatograph 

plane with the external surface of the skull, if we have an instrument for setting 
the skull with its auricular axis perpendicular to the plane of the craniometric 
table. That is possible by aid of the cranial coordinatograph. It only remains to 
settle through what point this plane perpendicular to the auricular axis shall be 
taken. If the skull had complete mirror symmetry then this plane should bisect the 
auricular axis, i.e. pass through the point — the Mid-porion — midway between the 
Right and Left Poria. In the random selection of crania I have used to illustrate 
this paper, this Mid-porion plane deviates so widely from any supposed median 
sagittal section, that no one would think of using it. By aid of my coordinatograph, 
Dr Morant kindly drew for me on a Hindu skull the Frankfurt Horizontal Plane*, 
the usual Transverse Vertical Plane, as determined by placing the skull on a Ranke 
craniophor, and the Mid-porion perpendicular plane. Plates I — III indicate the 
absurd results thus reached. The absurdity lies in the fact that in no ordinary 
skull is the auricular axis perpendicular to any reasonable Median Sagittal Plane, 
and the sooner we, as craniologists, realise this the better. The auricular axis makes 
an angle differing from a right angle with the Median Sagittal Plane and has no 
real claim to be selected as a “horizontal line.” A very brief experience will con- 
vince an attentive observer that when the subject is holding his head “straight” 
the two ears are not usually on the same level, to say nothing of their equality 
in distance from any mid-line of the facef. 

I have, perhaps, said enough to convince the reader that the fundamental crux 
of the determination of the standard planes of the skull lies in the discovery of an 
adequately satisfactory “median sagittal section,” i.e. a standard vertical sagittal 
section or an approximate mirror plane. This must precede the determination of a 
standard horizontal plane, if only for the strong reason that with a truly symmetrical 
skull we can find twelve or more points which lie in the mirror plane, while the 
“horizontal plane” has only some four points for its determination, and these in 
no case so simply determinable as the positions of those we have spoken of as 
mid-sagittal points. The “crux” lies in this: we have twelve or more points which 
“should” lie in one plane. If they don’t, what is an adequate representative of that 
plane? The mathematician would answer at once that a “good” substitute for it 
would be the plane that made the sum of the squares of the distances from it of 
these mid-sagittal points a minimum f. At the present stage of .these investigations 
“weighting” the mid-sagittal points must be left on one side, and we solve our 

* Not determined from the left Orbitale, but from the mean height of right and left Orbitalia. 

f In very marked caseB the unequal height of the ears is recorded as an “ anatomical anomaly,” 
see AnnaU of Eugenic i, Vol. iv. pp. 235 — 6, Plates III and IV. 

X Not infrequently such a plane is termed the “best” plane. But to accept this view we should first 
have to show that the deviations of these points from this plane followed the normal law of frequency, 
and that the standard errors of the deviations of the individual points were the same, otherwise the 
question of weighting the individual points arises. This suggests valuable, but very laborious work in 
determining the variations of what I have termed the mid-sagittal standard points from a mid-sagittal 
plane and a consideration of their individual frequency distributions. We should probably find ourselves 
finally thrust on the problem of whether the standard deviations varied from raoe to raoe. If they should 
do so, weighting would be very troublesome. 



Karl Pearson 219 

problem by stating that the standard mid-sagittal plane is the plane— in a mathe- 
matical sense— of close fit to the twelve or more mid-sagittal points. 

It was with regard to problems of this kind, in particular cranial problems, that 
in 1901 I published a paper on “Lines and Planes of Closest Fit to Systems of 
Points in Space*,” and showed that the determination of good fitting planes 
depended on finding the standard deviations and correlation coefficients of the 
coordinates of the points in space. The solution was reduced to a problem in solid 
Cartesian geometry. To apply it to the skull we must (i) determine some three 
rectangular planes associated with the skull — these I term the fundamental reference 
planes, and (ii) have some instrument which will rapidly provide us with the three 
coordinates of any point whatever of the skull in relation to these three planes. 
Such an instrument I term a cranial coordinatograph. One especially designed by 
me and made tor me by Messrs Hawksley and Sons will be described below. 

Now let us see where we stand. The skull may be looked upon as a system of 
indefinitely numerous points. By aid of the cranial coordinatograph we can at once 
form tables of the coordinates in space of any number of these points we please. The 
instrument enables us to construct plan and elevation drawings of these points. 
We can then proceed to deduce properties of the skull either by the methods of 
solid Cartesian geometry so familiar to the mathematician, or by the graphical 
rules of plan and elevation drawings so familiar to the engineer. 

We are thus able to determine (i) the distance between any two points on the 
skull — the callipers may be dispensed with, although the tape will still be required; 
(ii) the equation to the line joining any two points and the angle made by this line 
with any other line or plane; (iii) the angle between any two planes as represented 
by their equations; (iv) the standard mid-sagittal plane as defined above, and — 
perhaps the most important of all determinations — whether (v) any two homologous 
points on the skull have true mirror symmetry. The cranial coordinatograph seems 
to me to throw open a new field in craniometry, much as the modem theory of 
statistics did some forty years ago. It adds solid analytical geometry to the 
technique of the craniologist, and provides a valuable addition to his inatrumentarium. 

I have no desire to screen the labour of calculation involved in the new processes 
suggested in this paper. The question is: Are the results worth that labour? 
Personally I think they are. It is something to gain a means whereby we can 
distinguish between the adequacy of various median sagittal planes, or obtain a 
measure of the inadequacy of the Frankfurt Horizontal Plane. And these are only 
two of the many problems solvable by the new method f. 

* Philosophical Magazine^ November, 1901, pp. 559 — 572. 

t Apart from new problems we obtain other methods of solving old problems. We have the co- 
ordinates of Nasion, Alveolar Point and Basion ; they give the equations to the sides of the fundamental 
faoial triangle; and we can study its angular properties without using the trigonometer. Or again we 
have the coordinates of Nasion and Prosthion (or, if we prefer, Alveolar Point), and have the equation to 
their join. One of the direction cosines of this line gives the angle it makes with the Frankfurt Plane, 
that is the Profile Angle. We can thus dispense with the goniometer, whioh is at beet a faulty instru- 
ment for it assumes that the line joining Nasion and Prosthion projeots into a vertical line on the 
usual Transverse Vertioal Plane, whioh in general it does not. 


15—2 



220 


The Cranial CoordincUograph 

Before we can determine the coordinates of points on the skull we must fix on 
coordinate planes. These might of course be selected in any manner, but it is 
convenient to select for them already familiar planes. I call them the planes of 
reference, and the reader must distinguish them from the standard planes of the 
skull, which are something quite different. I take as planes of reference : (i) the 
Frankfurt Horizontal Plane as determined by balancing the Poria on the extreme 
points of the knife edges of a Ranke craniophor. The Orbitalia are then marked on 
right and left orbits and the skull rotated until the horizontal plane through the 
auricular axis (i.e. the join of the Poria* or the knife edge of the ear plugs) bisects 
the difference of the heights of the two Orbitalia f. This is our first reference plane. 
With the skull thus adjusted and the scriber at this height, the most posterior 
point in the occipital region in this plane (to be called the plane of x = 0) is marked 
on the skull. It will be called Kappa. This point Kappa is usually on or close 
to the occipital protuberance. The Frankfurt Horizontal Plane as thus defined 
will pass either through both Orbitalia or above one lower orbital margin and 
below the other. See Plate I (a), (ii) While the skull is still adjusted to this 
slight modification of the usual Frankfurt Plane, the horizontal bar of the 
craniophor is brought down until it is in contact with the skull and moved forward 
or backward until its point meets the sagittal suture. This point is marked on the 
skull. It is the Apex in ray terminology J. The plane through the Apex and the 
Poria is that of the so-called Transverse Vertical Plane, the plane for which the 
Transverse Contour is usually provided. It is to be our plane of (xz) or y = 0. 
The planes (i) and (ii) meet in the auricular axis which is accordingly our axis of z . 
(iii) Any plane perpendicular to the auricular axis will serve as the third plane 
of reference. Now suppose the skull removed from the craniophor, and adjusted so 
that the auricular axis is perpendicular to a drawing-board, then the paper on that 
drawing-board may be taken as the plane of ( xy ) or z = 0. The jpoint in which the 
auricular axis meets this drawing-board will be our origin, or the origin is the plan 
of the two Poria. If we now project onto the drawing-board all points we please of 
the skull, we shall have their plans on the plane of z = 0. These will give us their 
x and y coordinates. The plan of the Apex joined to the plan of the Poria will 
give the axis of x } and the line in this plane through the plan of the Poria 
perpendicular to the join of the the plans of Apex and Poria will be the axis of y . 
This axis of y should very closely pass through the plan of Kappa if our adjustments 
have been accurately made. I take the positive direction of the axis of x to be away 

* I prefer to mark the Poria only after the skull is supported on the knife edges. 

t The scriber set to the level of the left of the knife edges is first applied to one Orbitale, and the 
skull rotated until this Orbitale and the Poria are in one plane. The scriber is then applied to the Becond 
orbit ; if its Orbitale is above the scriber, the scriber point is marked on the skuU below the Orbitale, and 
the skull rotated till the scriber bisects the difference. If the scriber as first set is above the second 
Orbitale, then that Orbitale is brought up to the scriber, whioh when applied to the first orbit will now be 
below its Orbitale and the bisection must take place on that orbit. The ultimate plane, to be called that 
of {ty) or a?=0, is our first plane of reference. 

t The Apex must not be confused with the Vertex \ the latter is the point of the skull at maximum 
perpendicular distance from the Frankfort Horizontal Plane. 



Karl Pearson 


221 


from the Apex, i.e. towards the base of the skull, and the positive direction of the 
axis of y away from Kappa or towards the face of the skull. I have chosen the 
plane of y * 0, or the Transverse Vertical Plane, as the plane for giving the elevations. 
The Frankfurt Horizontal Plane is the plane through the axis of y perpendicular to 
the plane of the drawing board, and, if the skull were truly symmetrical, the plane 
of the plans would be parallel to the mirror plane or a true Median Sagittal Plane. 
Accordingly all the mid-sagittal points would have the same elevations. 

Plates IV — VI provide photographs of the plan and elevation drawings of six 
skulls — those of a Fuegian, an ancient Egyptian from Nubia, a modern Arab, a 
Negro from the Teita Hills district, a 17th century Londoner, and a modern Hindu. 
These skulls were chosen at random, and the diagrams in every case show us that the 
mid-sagittal points do not lie in a single plane perpendicular to the auricular axis, 
for if they did these points would have equal elevations*. A true mid-sagittal plane 
is a fiction in every one of these cases, and we are compelled to replace it by the 
idea of the “closest fittirig*' plane to the chief mid-sagittal points. 

2. The Cranial Coordinatograph . 

In order to obtain the plans and elevations of points on the skull (not confining 
ourselves as in our present illustrations to the mid-sagittal points) it is needful to 
devise an instrument which will serve three purposes : 

(a) Set the line joining any two selected points of the skull perpendicular to 
a drawing board. In our present illustration this line is the auricular axis or join 
of the Poria, but we might find in other investigations that some other line would 
be of more service as axis of z. This can equally well be achieved by the coordinato- 
graph. 

(b) The instrument must be capable of measuring the height of any point 
above the drawing-board, that is we must read off easily upon it the elevations of 
any chosen cranial points. 

(c) At the same time it must by a simple action record on the drawing-board 
the plan of the cranial point. These objects are all achieved by the present 
instrument. 

Diagrammatically the instrument consists of three arms, two of which are capable 
of fine motion and to each of which a vernier is attached. See Fig. 1 on p. 222. 

AB is a vertical rod on which slide the two arms DiPi and 2) a P a , which can be 
brought into absolute contact such that the points Pi and P a coalesce. CD$ is 
a fixed arm, and carries a vertical cylinder in the axis of which is a needle point Pa. 
When the button at C is pressed, this needle comes down on the drawing-board and 
the inked rim of the cylinder makes a circle with a needle point in its centre. This 

* The models based upon the plan and elevation drawings provided by the cranial ooordinatograph 
had to be tilted for photography in order that the names of the points should be legible. Accordingly 
the reader will find it best when examining them to hold the page nearly vertical, when the models 
come more closely into correct perspective. 



222 


The Cranial Coordinatograph 

is the plan of the point, with which one or other of the points Pi or P% is placed in 
contact. The three points Pi , P a and P 8 are in a vertical line perpendicular to the 
base BB f of the instrument. Thus the line PiP*Pa is accurately parallel to the 
vertical rod AB. This rod is graduated in millimetres from zero at P. Thus with 
the fine adjustments and verniers the height of Pi or P a or their difference in height 
can be determined with close approximation. The other part of the instrument, 
the cranial staddle or skull trivet — it would be misleading or ambitious to call 
it a craniophor — consists of a triangular plate carrying a saucer, and supported 
by three screws at the angles. The skull is placed on a bed of plasticine in the 
saucer, with the required line roughly adjusted to the vertical. The point P a of the 



Fig. l. 


coordinatograph is brought into contact with the lower point of the line ; the upper 
point Pi is brought down to the approximate level of the upper point. Then by the 
turning of the three staddle screws and the fine adjustment movements of the arms 
Pi Pi and P a P a , the skull is brought into contact at the given points^— e.g. the Poria — 
with the terminals Pi and P a of the arms of the coordinatograph. The adjustment 
does not take in practice much longer to carry out than it does to describe, and when 
it is completed the required line-r-e.g. the auricular axis — is perpendicular to the 
drawing-board — a touch on the button at 0 now records the position of the origin on 
the drawing-board. The upper arm is then raised above the skull out of the way, and 
the lower arm used ; it is brought into contact successively with the points of the 
skull which are to have their coordinates found. The button C determines the plan, 
the scale on the vertical A JS, with the vernier at D a , gives the elevation of the point. 
The plan of the Apex gives the <r-axis, and the line through the origin perpendicular 
to this gives the y-axis. Thus the three coordinates of any point can be found from 
the recorded elevations and the measurement of the plan drawing ou the board. 



Karl Pearson 


223 


The coordinatograph and the cranial staddle are made relatively heavy, bo that they 
may not be moved too quickly either by over-haste or accident, and thus compel the 
user to readjust the skull and start afresh. Care has to be taken in adjusting the 
skull on its bed, that no required point lies immediately above or very close to one 
of the screw legs of the staddle. The base plate of the staddle is raised sufficiently 
above the drawing-board to allow of the plan-recorder Z) 8 0 passing beneath it as 
the plans of one or two points of the skull will frequently be beneath this plate. 
With practice in the use of the apparatus I think the adjustment of a skull and the 
determination of the three coordinates of some twenty points do not involve much 
more than an hour’s labour. 

Table I gives as an illustration the three coordinates of some fourteen points on 
six skulls referred to the reference planes we have discussed above. After a short 
discussion of mirror symmetry, I shall return to these sets of coordinates and indicate 
the type of problem in which their determination can be of service. 

Plate VII (a) gives a photograph of the Cranial Coordinatograph in its first form. 
Slightly to the right we see a skull on the cranial staddle ; to the left of it the two 
scriber arms of the coordinatograph are adjusted to the upper and lower Poria of the 
skull. The scale, the fine adjusting mechanisms of the scriber arms, with the verniers 
on the sloping faces of the cut away portions of the arm brackets are visible. Two 
instruments for recording plans are also in the picture. That on the right is of the 
Klaatsch type*, and is set for determining the plan of the glabella of the skull. That 
on the left is the plan-pricker from my osteometric instrument. The upper is the 
scriber arm, the lower arm marks the plan of its point by a circle (inked with a pad) 
with central needle point, corresponding to the cylinder-bearing arm of the diagram 
in Fig. 1. The use of such auxiliary instruments requires a double operation, the 
measurement of the elevation by the coordinatograph and the location of the plan by 
some form of vertical projector. This double instrumental setting has been got rid of 
by attaching an arm like that on my plan-pricker on the left to the base of the 
coordinatograph : see Plate VII (6), Thus, if either arm be set to a point on the 
adjusted skull, a tap on the plan-pricker gives the plan, and a reading of the scale 
on the vertical upright of the coordinatograph the elevation of the point in question. 

3. On the Symmetry of the Skull. 

I have indicated in my introductory remarks that I do not look upon the 
Frankfurt Horizontal Plane passing through the auricular axis as the best approach 
we can make to one of the fundamental planes of the skull. The first thing which 
strikes an observer of the human frame is its superficial approach to symmetry. 
This symmetry is not axial, but planar or mirror symmetry. Take any body whatever 
with a plane side and place that side against a mirror, and we see at once an example 

* This does not differ essentially from Liasauer’s Diagraph {Archiv filr Anthropologic, Bd. xv. 
Supplement, S. 15 and Tafel XIV), but I was personally Interested in Kl&atsoVs contour-tracings (of 
which I possess several originals), and I had the original projector used in the Biometric Laboratory 
made for me to his pattern by his instrument maker. I therefore speak of it as of “ Klaatsch type” 
without claiming for him, or indeed for Lissauer, the invention of oraniographs. 



o 

II 


** 

'S 


& 

ss 

- * 
a -i 

3 r S 

H 'w 


ts 

1 


1 


1 

3 


a 

gj 

<* 

£ T 1 «P y * * *■ ¥> 'P r £ 
^SSSSSSSSSBSSSS 


S8SS3°f:SS$5? c> SS c> 

+ + + + + i i i i i i + 

H 

oo«owt'«o^>o«odsi"^*9 

ftSSjSlSS^^&SS 0 

+ + 1 lit 1 + + + + + 

w 

English 

N 

»pt^opa5»7*i^'^oo<NO*9>cp^ 

*— If— Ir-lr— If— If— If— 4r-Hr— lf—*l— *1— If— l»-H 


S§8SS 0 SSSf2% ll< 5» 0 
+++++ 111111 + 

H 

i»^ipa9«99'jj'H«7'«9 

■«t in w 55 f-< <n «5 i— ih<n 

+ + i i 77 i + + + + + 

(D) 

Negro 

- 

•^t-aoi^f^epoo^^Qp^opo^ 

a r- ^ cb « rp »n >h «5 >b ® 

ooooQDaooOQOaoaoooGOQOacaOoo 

i— Ir— tr— If— «f~li— «f— <i— If- l »— * i— 1 *-* r— 1 r— 1 

Sn 

n 90 ow 9 h®«K'jh?p®p 
ab^Mib'^OM'^WW^brH^.O 
o5 as a* 05 e M X CO l> » « *rH 

+ + + + + 1 1 1 1 1 + + 

H 

lpiOOit^-^rTH^cpipOiOSOiQp^ 

4-ieoeoiibi%<M&©cOM»a'+abo 
’d' cq N w O h 35 h p-i oq (N oq 

+ 4-i . 77 . + + + + + 

2-3 


aDapa5^>oa5<£fc05005wro4t*'+ 

35 ob 35 35 3S 35 oo oo cb oo os os os os 


00 rt N OO H O'h oq lO 00 Tf IH ^ o 

+++++ 1 1 1 1 I++ 

H 

coao^coo5^>oa<pcMoaaD»OQO<p 

a a is a s s $ ° * * Is a s ° 

H — HI 1 1 I 1 1+4 — 1 — H 

(B) 

Nubian 

*» 

aWWOlf^t-pH^np CO 05 00 Jg 

SSSS8SSS& 1 £> S §5 §g 

»— If—lr— If- If— If- If-li-Hr— 1 r— • i— 1 i— 1 ~h- 


tt>ip 0 p^-»-^O^* 7 ^WI 1 - ^ (N o 

tfis^ooiowi) i + cm <r> © 

CDOOOh 05 05 1— 1 CO -*f 

+ + + + + III 1 + + 

H 

cr>— icoooicoi— o-rti 0 x ^9 

538522^2 1 8§S° 
++ i i 77 i + + + + 

(A) 

Fnegian 

*» 

9>C59>«><X*pcOipa? rt< cm <p 

ao§8a888aoa8§D<3s3s 1 aS ao SB 8 

1—4 i— * i— 1 f—C i— 1 f— 1 f— 1 i— * r— 1 t— l i— 1 f— 4 f— 4 


WU5iT59^9^0e9 ^50-^^ 

3S&388°3ig£ |8-§° 

4 — H + + + 1 1 l 1 + + 

H 

aooiocpaoiocooco HM 90 

sssssiss ®® 1 sag 6 
. + 4 - 1 1 77 1 + + + + 


| 

| 

I 



The Inferior Inion was not used in the case of (A) and (B). In the case of (A) — (F) the Kappa was so close to the x-axis that I have taken it on that axis. 









Karl Pearson 


225 


* of such mirror symmetry*. This apparent symmetry is the striking feature of the 
living head or the skull, and it is idle to neglect it and suppose we can determine 
any standard plane without regard to it. Of course as spon as we begin to measure 
we find that the skull is very far from symmetrical. Much work has been devoted 
in the Biometric Laboratory in recent years to the question of asymmetry in the 
human frame, and more will shortly be published. I do not propose to deal at 
present with the results of those investigations, although they actually upset some 
current beliefs. As far as the skull is concerned those investigations deal with the 
measurement of homologous distances or the size and shape of homologous bones. 
But we have to remember that the brain controls the development of the brain case 
as much or rather more than the case controls the brain. Let us start with a 
hypothetical brain of perfect mirror symmetry, and let it retain this symmetry from 
the earliest fetal life. The homologous bones will spread from their ossification 
centres over the brain but they will spread . unequally and not homologously; the 
resulting brain case might possibly have mirror symmetry, but this could not be 
ascertained in general from a measurement of homologous bones. We need to 
ascertain first at least some approach to a probable mirror plane. If we suppose 
homologous bones to grow “at random,” according to a general law, but that there 
be no absolute equality of growth in homologous bones in definite directions, then 
they will meet and the sutures be formed in a more or less random manner, first 
the edge of a right side, then of a left side bone protruding into the territory of the 
other. The best we can do is to take some form of average of the sutures which 
should lie in the mirror plane. As we cannot attempt this for every point on the 
wholo series of sutures, we do the best we can by fitting a close plane to a reasonably 
large number of definite points on these sutures. If this gives us the best plane 
available for the skull, it by no means follows that it would be with equal closeness 
the mirror plane of the brain or of the living head. Indeed the brain might be truly 
symmetrical, while the skull was asymmetrical. Without having definite evidence 
to produce, I think, however, that the living head is on the average more asym- 
metrical than the skull. Suppose we take a full-face portrait of a head, how shall 
we determine what is its mirror plane, and judge what it would look like if 
symmetrical ? 

I tested in the first place a rough outline sketch of Cromwell's death mask — 
Plate VIII (ii) (6). The nose decidedly deviated to the left cheek. What is the 
“best” mirror plane? The only thing to be done was to bisect the lip line and 
the external occular distance; these points of bisection were joined, and the 
drawing cut in half down the bisecting line. The two halves were traced in 
reverse and then the two right sides were joined up and the two left sides were 
joined up to form absolutely symmetrical faces. The results in Plate VIII (ii) (a) 
and (c) are absurd, but suggestive. The reader can choose (ii) (a) with the 
duplication of the famous wart over the right eye, or (ii) (c) without the wart at all. 
We see at once that the skewness of the nose leads to a marked diminution of that 

* The mirror most be a silvered plate, and not the ordinary glass mirror, for the latter will show a 
vacant sheet between object and image. 



226 


The Cranial Coordinatograph 

organ, or to its exaggeration. In Plate VIII (i) I have applied the same treatment 
to a drawing of the Ashmolean bust of Cromwell. 

Now this difficulty follows us when we ask what would a familiar face look like 
were it symmetrical. We take a full-face portrait and at once are met by the 
problem: Where is the mirror plane to be placed? I took the photograph of a 
colleague and after several trials finally settled that again it was best to take the 
dichotomic line through the bisections of mouth and external orbital distance. 
This line would not be a truly vertical line as the head was slightly inclined to the 
right: see Plate IX (6). To my horror the bulk of the nose and neck fell in one half 
of the divided photograph. These two halves had now to be reversed, and this 
was done by rephotographing them, and obtaining two prints, one with the film 
against the printing paper and the other with the glass. Joining my two halves 
together I obtained two perfectly symmetrical faces — Plate IX (a) and (c) — but at 
the cost of much reality. Any one who examines these “ syrametricised ” portraits 
will I think come to the conclusions that (i) grace does not necessarily connote 
facial symmetry, and (ii) a good deal of personal individuality is linked with facial 
asymmetry. 

After this I felt able to deal with the problem of a symmetrical skull. Precisely 
the same process was repeated on the Norma facialis of an Egyptian skull. See 
Plate X. Allowing for the weakening of the reversed prints, I think, any one 
accustomed to handling skulls will be struck by the naturalness and individuality 
of the central photograph as compared with the symmetricised products to right 
and left of it. 

But while it is relatively easy to create symmetrical portraits of the living face 
or of the Norma facialis of a skull, it is far less easy to determine the plane 
associated with an actual head or skull, by aid of which we can appreciate its 
asymmetry. As I have indicated in the introduction to this paper, there are a 
certain number of points which may be termed the mid-sagittal points, for they 
should lie, if the skull were symmetrical, in one plane, the mid-sagittal (and are not 
infrequently assumed to do so even in a natural skull). These points are the 
following: (i) Alveolar Point, (ii) Nasal Spine, (iii) Nasion, (iv) Glabella, (v) Bregma, 
(vi) Apex, (vii) Lambda, (viii) Kappa (see p. 220 above), (ix) Occipital Protuberance 
(Superior Inion), (x) Inferior Inion, (xi) Opisthion, (xii) Basion, and (xiii) Palatal 
Spine. As I have indicated, the plane of closest fit, as judged by minimum mean 
square deviation to these points, will be defined as the standard mid -sagittal plane. 
In the past the mid-sagittal plane has been arbitrarily determined by selecting 
three of the mid-sagittal points, regardless of whether such plane was or was not at 
right-angles to other so-called standard planes. Let me illustrate the difficulties 
heretofore current by citing the weightiest authority oit the subject. I am inclined 
to think that in his case as in the case of many others the man who writes the 
biggest book is held to be the weightiest authority. If we cannot form an impression 
on the reader by the lucidity of our writings, we can at least impress him by the 
weightiness of our volumes. The very origin of the. terra weighty for an authority 
may be studied from this aspect. 



Karl Pearson 


227 


Hftdolf Martin in his Lehrbuch der Anthropologic in systematischer DarsteUung 
(note that word systematic !) tells us in Bd. n, S. 582, after emphasising the necessity 
of planes of orientation for the skull, that : 

*MTeber die Mediansagittal Ebene kann kein Zweifel beatehen ; sie ist durch drei Punkte 
(Naaioo, Inion und Basion) bestimmt. Zwar liegen diese nicht immer genau in einer Ebene, aber 
die Abweichungen sind so unbedeutend, dass sio in der Praxis vern&chlassigt werden kUnnen.** 

In translation : 

There can be no doubt about the Median Sagittal Plane. It is determined by three points 
(Na sion, Inion and Basion). These points do not lie always exactly in one plane, but their devia- 
tions are so unimportant, that in practice they may be neglected. 

Why Martin selected out of the many points which should lie on the mirror 
plane the Nasion, Inion and Basion “ without doubt” as those to determine the 
Median Sagittal Plane, he does not tell us, and I cannot tell the reader. I do know 
that one of his three points is one of the most difficult to determine on the skull, 
and no two writers seem to agree on how it is to be found ! But let us go a step 
further, we are told that the Median Sagittal Plane is to be determined by these 
three points — when they are found — that seems clear enough. But alas! we are 
then informed that the plane which passes through these three points will not 
exactly pass through them, but the deviations may be neglected. Did Martin not 
know that a plane is fully determined by three points? On S. 582, this would 
appear to be so; but on S. 583, speaking of the horizontal plane of the skull, he says 
it ought to go through four points, but as the skull is asymmetrical the plane can 
only be taken through three , “for three points mathematically determine a plane.” 

In the Biometric Laboratory the Nasion, Bregma and Lambda have been taken 
for drawing the median sagittal contour, partly because these points are more or less 
clearly determined by the intersection of the cranial sutures, and partly because the 
vault of the skull seems for many purposes more important than the base. 

Now let us return for a moment to Martin. His median sagittal section is to be 
taken (notwithstanding negligible deviations) through the three points— Nasion, 
Inion and Basion. We naturally look up his definitions of these points. They are: 

S. 619. The Nasion is the meeting point of the sutura nasofrontalis (i.e. the 
suture between the nasal and frontal bones) with the Median Sagittal Plane . This 
plane therefore determines the Nasion, 

S. 615. The Inion is the point in which the Lineae nucliae superiores meet in 
the Median Sagittal Plane ; if these lines are so feebly developed, that they do not 
reach the Median Sagittal Plane , they must be artificially produced till they do. 
Thus again the median sagittal plane determines the Inion. 

S. 615. The Basion is the point in which the anterior border of the foramen 
magnum is met by the Median Sagittal Plane . * Thus the most weighty of modern 
anthropologists defines the Median Sagittal Plane in terms of three cranial points, 
which according to him are only to be ascertained by an a priori knowledge of 
the Median Sagittal Plane. If this be “systematische DarsteUung,” is there not 
some need for a little mathematical logic— a little biometry to clear away these 
craniologicul fogs? 



228 


The Cranial Coordinatograph 

I have no desire to defend any particular plane which goes through three points 
as being the better representative of a plane which should pass through a dozen or 
more, but when we are told that there can be “kein Zweifel ” as to what is the 
Median Sagittal Plane one is tempted to ask whether Martin's “ohne ZweifeP' Plane 
is really superior to the nasion -bregma-lambda plane of the Biometric Laboratory. 
Having the coordinates of the thirteen mid-sagittal points for the six skulls selected 
at random which illustrate this paper, it was easy to write down the equations in 
those cases to Martin's Plane and the Biometric Laboratory Plane, and to measure 
(i) the angles between these planes, and the plane of “closest” fit to the thirteen 
points, and (ii) the mean square residuals of the mid-sagittal points from Martin's 
“ohne Zweifel'' Plane and the Biometric Plane. The results are given in Table II, 
A and B. 

TABLE II. 


A. Angles Plane of Maadmum Symmetry makes with the Biometric 
and Martins “ Median Sagittal Planes'* 


Skull 

Biometric Laboratory Plane 
(Nasion, Bregma, Lambda) 

Martin’s “ohne Zweifel” Plane 
(NaBion, Inion, Basion) 

Ancient Egyptian . . . 

0° 26' -6 

1° 20'-0 

Modem Arab 

1° 36'*3 

0° 65' -0 

Negro (Teita) 

1° 39' *5 

5° 46' *7 

Fuegian 

1° 59'*6 

7° 10'-5 

17th century English 

3° 14'-0 

0° 47'*8 

Bengal Hindu 

3° 56'-7 

6° 19'*2 

Mean Angle 

2° 8'*8 

3° 43' *2 


Or, Martin's Plane has on the average a 73 °/ 0 increase of angular deviation on 
the Biometric Plane. v 

B. Mean Square Residuals for the two “Median Sagittal Planes." 


Skull 

Biometric Laboratory Plane 

Martin’s Plane 

Ancieut Egyptian . . . 

1-5167 

1-5909 

Modern Arab 

4-0284 

2:5998 

Negro (Teita) 

5-7284 

32*0419 

Fuegian 

8-5685 

62-4137 

English 

27-3563 

4*0257 

Bengal Hindu 

31*1950 

35-5764 

Mean Value 

13-0055 

23*0414 


Or, Martin’s Plane has on the average a 77 % increase of Mean Square Residual 
on the Biometric Plane. 

The sections A and B of Table II show us that neither the Biometric Laboratory 
Plane, nor Martin’s “ohne Zweifel” Plane lies on the average very close to the 







Karl Pearson 


229 


Plane of Minimum Deviation from the mid-sagittal points. This plane may be 
spoken of as the plane of nearest approach to the mirror plane of the skull or more 
shortly as the Plane of Maximum Symmetry . This latter plane we shall take as the 
First Standard Plane of the skull, or the Standard Median Vertical Plane. Naturally 
the Standard Horizontal Plane will be perpendicular to this plane, and we shall 
determine how far the Frankfurt Horizontal Plane is deficient in this respect. The 
Standard Transverse Vertical Plane will be again perpendicular to both our Standard 
Plane of Maximum Symmetry and to our Standard Horizontal Plane, and we shall 
determine how far this plane deviates from the usual Transverse Vertical Plane 
through the auricular axis perpendicular to the Frankfurt Horizontal. It will be 
seen that whether we judge by angular deviations from the planes of maximum 
symmetry or by the mean square deviations the nasion-bregma-lambda plane is on 
our present evidence much superior to the nasion-inion-basion plane. Accordingly 
we shall not consider it needful again to refer at length to the Median Sagittal 
Plane of Martin. When we speak of the “usual” Median Sagittal Plane, we shall 
mean that in which the long series of median sagittal contours issued by the 
Biometric Laboratory has been drawn, i.e. the nasion-bregma-lambda plane. 

4. Procedure for the Determination of the First Standard Plane or Plane of 
Maximum Symmetry of the Skull. 

I have already indicated the first stage of this procedure, the determination of 
the reference planes, and in particular the auricular axis which is to be set per- 
pendicular to the plane of the drawing-board. But there is one point I should wish 
once more to emphasise. The auricular axis may be defined to be the line joining 
the extreme points of the knife edges on which the skull rests on the craniophor, 
when these knife edges are properly adjusted. This adjustment is sometimes defined 
as the process of bringing the tip of the knife edges to the Poria. But what are the 
Poria? Martin* defines the Porion “as that point on the upper border of the 
auricular passage which is vertically above the middle of the same.” But how the 
middle of the auricular passage is to be found, he does not tell us, nor can I conceive 
how it is possible for an asymmetrical conichoidal space to have a “middle.” Still 
less, if I could discover this “middle” could I take a “vertical” through it to meet 
the upper border of the auricular passage, because a vertical can only mean a line 
perpendicular to the horizontal plane, and that plane can only be found when the 
knife edge tips are already placed on the Poria. Thus according to Martin the 
Poria can only be found after the Frankfurt Horizontal Plane has already been 
determined. The very process of tilting the skull round on the knife edge tips to 
bring the Orbitalia to the height of the top of the knife edges causes the tips of the 
knife edges to slip along the upper border of the auricular passages. In my opinion 
the only way to determine the Poria is to mark them after the skull is adjusted on 
the craniophor to the Frankfurt Horizontal, the knife edges being withdrawn out- 
wards to the very verges of the upper borders of the auricular passages. If, with 
some definition, other than Martin’s, the Poria be marked before the Frankfurt Plane 


Loc . cit. S. 618. 



230 


The Cranial Coordinalograph 

is determined, then there is difficulty about successfully balancing the skull with its 
knife edge tips on these Poria; the tilting of the skull produces a constrained 
equilibrium and the knife edge tips tend to slip off the Poria and a minor catastrophe 
may result. 

However, having marked in one way or another the Poria, it is fairly easy to 
adjust the skull by the three screws of the skull staddle so that the two Poria are 
in contact with two upper arm tips of the coordinatograph, A touch of the button 
marks the point, where the auricular axis meets the plane of the drawing-board* or 
is the plan of the Poria and the origin of coordinates. Before the coordinatograph 
is moved, the elevations of the' Poria must be read off and recorded; a check oh their 
accuracy is that their differences should equal the intraporial distance, which should 
be taken with the callipers, when the skull has been removed from the craniophor 
and before it is placed on the staddle. The Apex and the Kappa are now projected 
on to the drawing-board by aid of the coordinatograph, and their plans, joined to 
the plan of the Poria, give the axes of x and y respectively. These axes should be 
perpendicular. If they are not, the y-axis must be taken perpendicular to the 
a;-axis, and the Kappa will have an ^-coordinate differing from zero. Such a coordi- 
nate was too small to be measurable in the skulls dealt with by me. The plan and 
elevation of every other “mid-sagittal” point is obtained in the same manner, and the 
record for each skull will be a series of coordinates similar to those given in Table I. 

The craniologist may not take it amiss if we remind him of the chief theorems 
in solid analytical geometry which are of value in the study of the skull. If 
(oi, i>i, Ci) ( a 2 , £>2, c a ) (as, 63, c 3 ) be the coordinates of three points, and 

lix + miy +n 1 z*spi and l 2 x + m 2 y + n*z = p2 
be the equations to two planes in the prepared form, i.e. such that 

£i 2 + 7Wi 2 -Mi 2a * 1, Z a 2 -t-m a 2 + n 2 a »» 1, 
then: * 

(i) The equation to the line passing through (ai, 61, Ci) (03, b 2 , c 2 ) is 

oc-(i\ _ y-b 1 _ zj-c h . 

Oi — at b 2 — bi c% — ci ^ '' 

(ii) The direction cosines of the lines are 

L\i * (a* — ai)/ ria , = (6* — &i)/ri a , A 12 = (c a — ci)/r M ) ... . 


where r M 2 » (a* - (h)* + (62 - &1) 2 + (C2 - c x ) 2 J v 

(iii) The angle between two lines, 0 , is given by : 

cos 6 « 2/84 * 1 “ -Jfu -^34 + *^18 -^84 (iii). 

(iv) The angle <f> between two planes as above is given by 

cos ^ — ZiZg-HwiW^-H riinf (iv). 

(v) The angle y/r between a line and a plane is given by 

sin y/rmt Auii-f + (▼).* 



Karl Pearson 


281 


(vi) The plane through the points is given by 


x —ai y — bi z —ci 
a t - a i b , - bi c» - Ci 


= 0 


(vi). 


I Qs — &3 — &l C3 — Ci I 

where it is simplest to express the determinant numerically before expanding it. 

(vii) Let x, y , z represent the coordinates referred to the three planes of 
reference already discussed of any of the “ mid-sagittal” points and S denote a 
summation for all these points : let x, y, z be the mean coordinates of all these 
points, n in number ; let 


-l S(aP) - 3*. ** = 1 8(y») - y\ a* = 1 S(z*)-z*, 


be the x, y and z squared standard deviations ; and let 

Ptv~\s(zy)-zy> Pxt = ~S (xz) - xz, p yx * 
be the product moment coefficients. 

We must then solve the equation 

°x 2 * - 2 a , p yx . 


S(yx)-yx 


or 


Pyx* 
\ Pxil 


P<u 

<r y * ~ 2 a , pzy 

Pzy <*z 


-S 2 


-0 


.(vii), 


2 8 — 2 4 (&x + a y 2 + &*) + 2 2 * (cT^ 2 (T^ 8 4* 0"x 8 &Z + <? y* CT* P* zy p 2 XZ — P\x) 

~ (cr x 2 <rj*a* - <r*p\ - c 2 p\ z - <r?p\ x 4- 2 p zy p M p yx ) = 0 (viii), 

and we shall obtain three values 2i 2 , 2 2 a , 2 8 a of 2 2 *. These values of 2 a are the 
minimum and the two maximum values of the mean square deviations of the 
n points from three planesf. The 2i a which gives the minimum value is the one 
we are seeking in the first place ; it provides the plane of maximum symmetry, or the 
nearest approach we can get to a true mirror plane or median sagittal plane of the 
skull. The Standard Horizontal Plane and the Standard Transverse Vertical Plane 
must be defined with regard to this plane, and we shall deal with them later on. 

To determine this plane we must solve the equations : 

(ar x * — 2i 2 ) Li + p yx Mi + P{czN i = 0 1 

Pyx Lr 4- (ar y 2 - 2i 2 ) M x +PzyN x = ° (i x ) t 

p»z 4- Pzy 4- (o> 2 — 2i a ) N\ = 0 
subject to Li* + 4- » 1 

Li, M 1} N x are the direction cosines of the plane of maximum symmetry, and 


P x = Lix+ Miy + Niz (x) 

or the equation of the plane is 

Li (x - x) 4- Mi (y - y) 4- N% (z - z) « 0 (xi) 


and 2i a measures the mean square deviation of the mid-sagittal points from this plane. 


* Philosophical Magazine, 1901, pp. 561 — 568. 

t All the terms in brackets are positive, and the equation has three real roots. 



232 The Cranial Coordinatograph 

5. Illustration of Numerical Work . 

We will now give a numerical example of finding a plane of maximum 
symmetry, and then more briefly provide the equations giving 2i a and this and 
other planes for several skulls. We will take the Arab skull, section (0) of Table I, 
p. 224, and consider the 13 sets of mid-sagittal points, coordinates given in the 
following table : 

TABLE III. 

Coordinates for Arab Skull. 


Point 

X 

V 

z 

z'—z - 198*5 

1. Alveolar Point 

+ 46*6 

+ 88*8 

198*1 

4-4*6 

2. Nasal Spine 

+ 28*8 

-f92*3 

1981 

4-4*6 

3. Nasion 

- 29*4 

4-83*7 

196*5 

4-3*0 

4. Glabella 

- 39*6 

4-84*8 

196*0 

4-2*5 

5. Bregma 

- 119*9 

4- 4*1 

190*6 

-2*9 

6. Apex ... 

- 119*0 

0*0 

196*1 

4-2*6 

7. Lambda 

- 49*9 

— 78*1 

189*7 

-3*8 

8. Occipital Protuberance 

- 6*2 

-75*5 

190*6 

-2*9 

9. Kappa 

0*0 

-71*2 

189*8 

-3*7 

10. Inierior Inion 

4- 6*9 

-65*8 

189*8 

-3*7 

11. Opistliion 

+ 29*8 

-38*4 

193*2 

-0*3 

12, Basion ... 

+ 25*5 

4- 4*1 

193*1 

-0*4 

13. Base Palatal Spine ... 

+ 30*8 

4-45*4 

194*4 

4-0*9 

Sum 

-195*6 

+ 74-2 

2516*0 

+ 0*5 


Sum -r 13; 

(i) 15-046,154, y- +5*707,692, z~ 193*538,4616, £'» + -038,4615. 

Squares : 

(ii) £“= 226-386,750, 5^=32-577,748, ... ^='001,470. 

Products : 

(iii) + *219,520, - *578,698, ... xy= -85*878,813. 

Mean Squares of Coordinates from Table IV : 

(iv) £(.r 2 )/13 = 3002*578,4615, S (y a )/13~ 4259*210,769, £(*' 2 )/13 = 9*640,769. 

Mean Products from Table IV: 

(v) 8 (yz')l 13 « 181 *356,923, S (^)/13 = 29*892,308, S (ay)/13 « 365*822,308. 

(iv) minus (ii) : a* 2 * 2776*191,711, <r, 2 = 4226*633,021, oy*« 9 *539,290. 

(v) minus (iii): 181 *137,397, ^-30*471,006, ^-461*701,121. 

We are now in a position to write down the fundamental cubic as given by 
Equation (viii). It is 

2 6 - 7012*364,022 2* + 1156,2972-392,766 2* - 1996,0121*046,469 « 0. 

The ratio of the coefficients of the last two terms is 1*726, and we test with 
2* «= 1*730 giving 22839, and 1*728 giving - 238*353. The root is close to 1*728. 




Karl Pearson 


283 


Applying Newton’s Rule we obtain 1728,0203 and thence finally S,*- 1-728,0207. 
This is the mean square deviation of the thirteen mid-sagittal points from the 
Plane of Maximum Symmetry. The other two roots of the cubio will be discussed 
later. 


TABLE IV. 

Squares and Products of the Coordinates of the Arab Skull*. 



— 

V 1 

*' a 

y*' 

*'(S 

xy 

(1) 

42171*56 

7886*44 

21-16 

+ 408-48 

+ 214-36 

+ 4138-08 

(2) 

829-44 

8519-29 

21-16 

+ 424-58 

+ 132-48 

+ 2658-24 

(3) 

864-36 

7005-69 

9-00 

+ 251-10 

88*20 

- 2460-78 

(4) 

1568*16 

7191-04 

6*25 

+ 212-00 

- 99-00 

— 3358-08 

(6) 

14376-01 

16-81 

8-41 

- 11-89 

+347*71 

- 491*59 

(6) 

14161-00 

0-00 

6-76 

0-00 

- 309*40 

0-00 

(7) 

2490-01 

6099-61 

14-44 

+ 296-78 

+ 189*62 

+ 3897-19 

(8) 

38-44 

5700*25 

8-41 

+ 218-95 

+ 17-98 

+ 468*10 

(9) 

0-00 

5069*44 

13-69 

+ 263-44 

0-00 

0-00 

(10) 

47*61 

4329-64 

13-69 

+ 243*46 

- 25-53 

- 454-02 

(11) 

888*04 

1474*56 

•09 

+ 11-52 

- 8*94 

-1144-32 

(12) 

650-25 

16*81 

•16 

- 1-64 

- 10-20 

+ 104-55 

(13) 

948*64 

2061*16 

•81 

+ 40-86 

+ 27*72 

+ 1398-32 

Sum 

39033-52 

55369-74 

124*03 

+ 2357-64 

+388-60 

+ 4765-69 


Sum-5- 13 3002 678,4616 4269-210,760 9-640,769 +181-356,923 +29-892,308 +365-822,308 

The whole of these values (Tables III and IV) have been put down for the use of any cranio- 
logist who may dosire to test the labour of finding a Plane of Maximum Symmetry. 

We have now to determine the direction cosines L%, Mi, Ni, of the Plane of 
Maximum Symmetry from Equations (ix). These give us : 

2774-463,690 L x 4- 451*701,121 M x 4 30*471,006 Ni**Q, 

451*701,121 Li + 4224*905,000 -f 181*137,397 ^ = 0, 

30-471,006 Li 4 181 137, 397 Mi 4 7*811,269 N t -Q. 

From the first two equations we find 

Li M x -Ni 

1 ~ 10-418,268 “ 245*492,616 

for the relative values, and since L£ 4 M\ 4 iVi 8 * 1 we have for the absolute values 
of the direction cosines 

Li « *0040,697 5, M x » 0428,9970, N x «- *9090,9244, 
and if these values be substituted in the third equation for Li, Mi> N\ above, the 


* In actual practice the squares are put directly on the machine from Barlow’s Tables, and the 
product multiplications are a continuous process on the machine. 

Biometrika xxv 


16 




234 


The Cranial Coordinatograph 

left-hand side will be found to be + *0000,0087 instead of zero, which checks the 
value for 2i a as we have only worked to six decimal places in the coefficients. 

Finally substituting in the values for Pi given by (x) we find Pi * — 193*181,851 
or the equation to the Plane of Maximum Symmetry is 

- 0040,6975a? - *0423,9970y + *9990,9244* « + 193*181,851 (xii). 

Knowing this plane we can determine the angles it makes with : 

(a) the Frankfurt Horizontal Plane, i.e. #-«0. The cosine of this angle 
= — *0040,6975, or the angle is 90° 14'*0. Thus the Frankfurt Horizontal Plane is 
not for this skull perpendicular to the Plane of Maximum Symmetry; 

(b) the Transverse Vertical Plane, i.e. y®0. The cosine of this angle 
= - *0423,9970 and the angle is 90° 21 '*8, or the Transverse Vertical Plane is not 
perpendicular to the Plane of Maximum Symmetry ; 

(c) the Plane perpendicular to the auricular axis, i.e. * = 0. The cosine of this 
angle is *9990,9244 and the angle is 2 C 26'*5. Thus the auricular axis for this 
skull is not perpendicular to the best plane we can adopt for mid-sagittal 
symmetry. 

Again for this skull the Left Porion has for coordinates (0, 0, 255*7) and the 
Right Porion (0, 0, 133*9); thus the Interporial Distance equals 121*8, and the 
Mid-porion is (0, 0, 194*8). Thus the Mid-porion is 1*3 mm. above the mean height 
of the thirteen mid-sagittal points, and further the Plane of Maximum Symmetry 
does not pass through the Mid-porion, but meets the auricular axis at the point 
* = 193*36. The perpendicular distances of the Poria from the Plane of Maximum 
Symmetry are Left Porion 62*29 and Right Porion 59*40, which indicate the 
extent the two ears are asymmetrically placed. We can now write down the Equation 
for the Median Sagittal Plane as based on Nasion, Bregma ai\d Lambda for this 
skull. By Equation (vi) it is 

# + 119*9, y — 4*1, *-190*6 *0, 

-29*4 + 119*9, 83*7-4*1, 196*5- 190*6 

- 49*9 + 119*9, - 78*1 - 4*1, 189*7 - 190*6 

or, # + 119*9, y- 4*1, *-190*6 =0, 

90*5, 79*6, 5*9 

70*0, -82*2, -0*9 

or, expanding, 

413*34# + 494*45y - 13011*10* + 252,7447*881 * 0. 

Dividing by the square root of the sum of the squares of the coefficients of 
#, y, * we have the equation to the plane in its prepared form, i.e. with the 
coefficients the direction cosines and the constant term the perpendicular from the 
origin, 

- *031,7294# - *037,9556y + *998,7756* - + 194*015,354. 



Karl Pearson 


285 


It is possible now by inserting the coordinates of any of the mid-sagittal points 
to determine its distance from this plane. The following are the distances of these 
points for the Arab skull : 


(1) Alveolar Point 

+ 1*0070 

(8) Occipital Protuberance 

+ *5864 

(2) Nasal Spine... 

+ *5750 

(9) Kappa 

+ 1-7453 

(3) Nasi on 

•0000 

(10) Inferior Inion 

+ 2-1692 

(4) Glabella ... 

+ *2175 

(11) Opisthion 

+ -5399 

(5) Bregma 

*0000 

(12) Basion 

+ 2*1165 

(6) Apex 

- 5-6204 

(13) Base Palatal Spine ... 

+ 2*5538 

(7) Lambda 

*0000 

Sum of Squares of these Distances 

52*3686 


Mean Square Distance from Plane 4*0284. 


Thus although the plane actually passes through three of the mid-sagittal points, its 
mean square deviation is more than double that of the Plane of Maximum 
Symmetry. While the mid-sagittal contour drawn in the Nasion-Bregma-Lambda 
Plane may serve many useful comparative purposes it clearly differs widely from 
any suitable mirror plane, and actually for this Arab skull the angle between this 
v.fi.K plane and the Plane of Maximum Symmetry is 1° 36'*3. It makes an angle 
of 91°49'*1 instead of a right angle with the Frankfurt Horizontal Plane, an angle 
of 92° 50'*5 with the auricular axis and an angle of 92° 10'*5 with the Transverse 
Vertical Plane. 

We may give as one furthur illustration the measure of prognathism as 
found from the angle between the Frankfurt Horizontal Plane and the line joining 
Alveolar Point to Nasion*. These points are (1) and (3), or for the Arab skull 
(-t- 466, 4* 88*8, +198*1) and (— 29*4, + 83*7, + 96*5). The equation to the line joining 
them is 

x - 46 ’6 I/- 88 8 _ *-198-1 

- 29*4 - 46*6 “ 83*7 - 88*8 196*5 - 198 : 1 ’ 

46*6 _y- 88*8 *-198*1 

or 76-0 51"" 1-6 ’ 

or the direction cosines (/, m , n) of this line are proportional to 

76*0, 5*1, 1*6, 

giving for absolute value, since P + m % + n* = 1, 

+ *997,536, + *066,940, + *021,001. 

The Sine of the angle this lines makes with the Frankfurt Horizonal Plane, i.e. x * 0, 
is *997,536 or the Profile Angle itself «■ 85° 58 '*6. 

If we ask how far does the Nasion-alveolar line lie outside the Plane of 
Maximum Symmetry we have to find the angle between the plane whose direction 
cosines are 

- *004,070, - *042,400, + -999,092, 

and the line with direction cosines 

+ *997,536, +*066,940, +-021,001. 

# This is of course the measure of prognathism in the skull, but the raoial order of prognathism in 
the living, owing to the thickness and protrusion of the lips, may be very different. Perhaps this fact is 
not always adequately emphasised. 


16—2 



236 


The Cranial Coordinatograph 

The sine of the angle (see Equation (v)) is + ’014,0837, giving an angle >of + 48'*4, 
or the Profile Line is skewed out of the Plane of Maximum Symmetiy towards 
the right side by more than three-quarters of a degree. 

We have illustrated sufficiently the general algebraic and numerical processes 
by which, when the cranial coordinatograph has given the coordinates, the properties 
of any skull can be discussed by the aid of the elementary formulae of analytical 
geometry of three dimensions. 

6. Analytical Geometry of six illustrative Crania. 

The following six illustrative skulls were chosen at random. It is not suggested 
that average results obtained from them would not be widely modified when con- 
siderable series of the same races are dealt with; they are solely given here as 
indicative of the type of problems the cranial coordinatograph enables the cranio- 
logist to attack. 

We have the following numerical results: 

(A) Fuegian Skull * (Recent). 

Elevation of R. Porion 253 8, L. Porion 126 2, Mid-porion 190 0, Bi-porionic 
Distance 127 '6. 

Cubic to determine Minimum Mean Square Residual Si 2 : 

S 6 - 7971*463,075 S 4 + 1460,0879-769,528 S 2 - 2446,0978*059,113 - 0. 

Value of S^ 1676,844. 

Plane of Maximum Symmetry: 

- *0095,7040 a; + *0157,3352 y + *9998,3036 s « + 189*0220,8197. 

v.fi.X. Planef : 

- *044,7048 * + *010,2709 y + *998,9474 * * + 190-677,8475. 

Martin’s Plane J: 

+ *122,4878* + *038,6513 y + *992,9011 z - + 188*966,1990. 

( B ) Nubian Skull * (Ancient Egyptian from Kerma). 

Elevation of R. Porion 247*0, L. Porion 132*7, Mid-porion 189*85, Bi-porionic 
Distance 114*3. 

Cubic to determine Minimum Mean Square Residual Si*: 

S 4 - 7662*666,546 S 4 + 1392,3838*355,612 S 2 - 1801,6485*994,210 - 0. 

Value of Si* *1*294,8535. 

Plane of Maximum Symmetry: 

+ *005,4027 * - *006,2292 y + *999,9660 jr - + 188*664,6436. 

* In the case of these crania the Inferior Inion was omitted. 

t The usual ‘‘mid-sagittal” plane, i.e. that through Nasion (v), Bregma (fl) and Lambda (X). 

X The plane through Nasion, Inion, Basion, assumed by Martin to be “ohne Zweifel” the “best” 
mid- sagittal plane. 



Karl Pearson 


287 


y.yS.X. Plane : 

- 0112,3688 a? 4- 0021,0153 y - *999,9347 z~ + 188*752,1300. 

Martins Plane: 

4- 013,937 1 m - 006,7188 y + *999,8803 b » 188*231,5380. 

(0) Arab Skull (Modern from Palestine). 

Elevation of R. Porion 133*9, L. Porion 255*7, Mid-porion 194*8, Bi-porionic 
Distance 121*8. 

Cubic to determine Minimum Mean Square Residual 2i*: 

S 6 - 7012*364,022 2 4 + 1156,2972*392,766 2* - 1996,0121046,469 - 0. 

Value of Sx* = 1*7280,2066. 

Plane of Maximum Symmetry: 

- *0040,6975 0 - 0423*9970 y + *9990,9244 b - + 193*181,851. 
y./8.X. Plane: 

- *031,7294 « - *037,9556 y 4- *998,7756 * - 4- 194015,354. 

Martin’s Plane: 

- 010,3855 a + 035,5214 y - *999,3150 z =* - 193*086,908. 

( 1) ) Teita Negro Skull. 

Elevation of R. Porion 130 6, L. Porion 241*8, Mid-porion 186*2, Bi-porionic 
Distance 111*2. 

Cubic to determine Minimum Mean Square Residual 2j a : 

2 4 - 7350*914,793 2 4 4- 1196,5899*9308 2* - 1369,2532*1 372 « 0. 

Value of 2i* = 1*1451,0147. 

Plane of Maximum Symmetry : 

- 0220,1936 x - 0083,0922 y 4 *9997,2301 * « + 186*384,314. 
v./3.\. Plane: 

- *0508,8478 * - *0064,6706 y 4* *9986,8359 z - 4- 188*167,4885. 

Martin’s Plane: 

+ 076,1587 x + 014,4832 y 4- *996,9905 * = 4* 186*761,565. 

(E) 17th Century English Skull (St Bride’s Graveyard). 

Elevation of R. Porion 258*7, L. Porion 133*3, Mid-porion >96*0, Bi-porionic 
Distance 125*4. 

Cubic to determine Minimum Mean Square Residual 

2* - 7887*700,659 2 4 + 1383,3943*158,623 2* - 4033,3720*552,071 = 0. 

Value of h* = 2*9204,2324. 

Plane of Maximum Symmetry: 

. 4- *0081,2136 a? - *0109,6533 y 4- *9999,0690 * « + 196*739,7800. 



238 


The Cranial Coordinatograph 


v.fi.X. Plane: 

+ -0528,8718 a - -0461,5479 y + -9975,8830 z - + 191*526,634. 

Martin s Plane : 

+ *0178,5298 # - *0011 ,2259 y + 9998,3999 * = + 197*122,354. 

(F) Hindu Skull (Modem, Bengal). 

Elevation of R. Porion 130*4, L. Porion 241*6, Mid-porion 186*0, Bi-porionic 
Distance 111*2. 

Cubic to determine Minimum Mean Square Residual 2i 8 : 

7399*968,994 2 4 + 1162,8428*712,714 2607,0720*672,768 = 0. 

Value of Sx*-* 2*2451,8824. 

Plane of Maximum Symmetry: 

-•001 7,41 54 <r- *0054,1230 y + *9999,8384 z = + 186*605,594. 

v.fi.X. Plane: 

+ *0667,4424 x — *0118,7277 y + *9976,9948* » + 180*915,098. 

Martin's Plane: 

- *1114,3673 x - *0143,7877 y + *9936,6750 * = + 184*759,760. 

On the basis of these and similar results we will now proceed to some com- 
parisons. 

7. Angular Relations . 

We have seen that it has been customary to treat the Frankfurt Horizontal 
Plane, the Transverse Vertical Plane through the auricular axis and a certain plane 
termed the Median Sagittal Plane as standard planes of the skull. Such standard 
planes should be mutually rectangular, but while the first two aje at right angles, 
they are rarely perpendicular to the third. The third plane as defined by Martin, 
to judge from the present illustrative crania, seems very inferior to the plane 
(see our p. 228). We shall here then confine our attention to the latter plane. 
Table V provides the angles between our First Standard Plane of the skull — the 
Plane of Maximum Symmetry— as representing the mirror plane of the skull, and 
the planes which have been usually hitherto treated as standard planes, but which 
we treat merely as planes of reference. 

Section (a) of the Table shows that the Mid-sagittal Plane in none of 

the six skulls approaches closely to the Plane of Maximum Symmetry, the average 
angle between the two planes being more than two degrees. In the English skull 
there is more than three degrees and in the Hindu nearly four degrees. Thus the 
i/./S.X. plane cannot be looked upon as a close fit to the mid-sagittal points. 

Section (#) shows us that the Frankfurt Horizontal Plane is not perpendicular 
to the Plane of Maximum Symmetry, which I personally think should be a pre- 
requisite of a standard horizontal plane. The average deviation is over 30'* 





3 

if 

? 

i? 

■3,1 

-f 

1—4 

«■§ 

l§;9 

« 

■gfc 

J~ 


g 9 <N 
*a % ja 

0} lO bp 

0o fl 

ft ~ 


Karl Pearson 


31 1 f 

S 8 I 


a iq qo 

|! fc '? 


® pH 

Is* t 

®ag i 


M » 
h? § 

5 £ 

* T 


'p* w ►» 

JS »H .* ft 

si * I 
V~ : 


s - 9 g, 

|l S | 


fe? ? | 

«fl « | 

I s - b A 


^10 ft 

II P | 

SkH. ^ «H 


|| to s 

§?« o" 5 
'A H M ◄ 
a 


3 £ g 4 


0? 9 << 

lie 


41 b 6 
o S f I 
wa s *« 


f * .* | 
«a g < 


I t 


« 


ft 

s 



240 


The Cranial Coordinatograph 

Section (7) indicates that the usual (1/./8.X.) Mid-sagittal Plane is even worse in 
the degree of perpendicularity to the Frankfurt Horizontal Plane, the average 
deviation amounting to 2° 27 /4 3 or more than four times that of the Plane of Maxi- 
mum Symmetry. 

Section (S) shows us that the auricular axis (Bi-porionic line) is not perpen- 
dicular to the Plane of Maximum Symmetry, the average deviation for the six 
crania being slightly over 1°. From the standpoint of the present writer this is 
one measure of the asymmetrical location of the ears. Relative to a good approxi- 
mation to a mirror plane the two ears are shifted slightly forward or backward, 
upward or downward. 

Section (c) gives the like angle between the auricular axis and the perpendicular 
to the v./8.X. Plane. The average angle for this skull is 2° 49'*8 or more than 2£ 
times as great as in the case of the Plane of Maximum Symmetry. 

Section (f) gives the angle between the Transverse Vertical Plane and that 
of Maximum Symmetry, the average deviation is half a degree, or the customary 
transverse vertical plane is not perpendicular to the plane of closest fit to the 
mid-sagittal points, as it should be in the opinion of the present writer. 

Section (17) indicates, however, that the Plane of Maximum Symmetry is more 
than twice as fit as the 1 1 .&\. Plane to represent the mid -sagittal section. If we 
start from the Frankfurt Plane and the Transverse Vertical Plane passing through 
the auricular axis as standard rectangular planes of the skull, then the Plane of 
Maximum Symmetry is more nearly perpendicular to both of these than the usual 
mid-sagittal section. 

8. Mean Square Deviation of the Mid-sagittal Points from various Planes . 

We can now look at another aspect of the relationship of the mid-sagittal points 
to the various planes which may be suggested for the mid-sagifrtal section. We 
may determine the Mean Square Distance, or the so-called Mean Square Residual, 
of the mid-sagittal points from the various planes. These values are given in 
Table VI. The five planes which may be considered as possible mid-sagittal planes 
are: (a) Our standard plane, the Plane of Maximum Symmetry: this must, of 
course, have the minimum Mean Square Distance. (6) The plane perpendicular to 
the auricular axis at the mean elevation, £, of the mid -sagittal points. We may 
term this the “£ Plane 0 ; its Mean Square Distance is the <r f a of our p. 231. ( c ) The 
plane perpendicular to the auricular axis and bisecting it. This may be termed the 
mid-porion sagittal plane. ( d ) The mid-sagittal plane through Nasion, Bregma and 
Lambda (the v.ftX Plane), (e) Martin’s Plane through Nasion, Inion and Basion. 

We have already (see p. 228) discussed the last two planes, ( d ) and (e) of the 
Table. As far as the present skulls are concerned neither is comparable with (a), 
(6) and (c). If any craniologist finds the solving of a cubic equation too severe 
a mathematical labour, the coordinatograph will rapidly give him z and the Mean 
Elevation Mid^sagittal or z- Plane. The Plane of Maximum Symmetry is twice as good 



Karl Pearson 241 

as this plane, but the latter is very much better than either of the planes fixed by 
three mid-sagittal points only. 

Clearly the mean square residual for any skull is a rough test of how far the 
mid-sagittal points lie in one plane, i.e. how far there is a true mid-sagittal plane, 
on the basis of which we could test the mirror symmetry of other points of tho ikull. 
A little thought, however, shows that this is only a rough approximation; the 
skulls are of different absolute sizes, and if we increased the linear dimensions of 
a skull by 5% we should increase the mean square residual by more than 10 °/ 0 * 
We need accordingly an index from which we have eliminated the absolute size 
of the skull. We cannot therefore assert that the relative mid-sagittal asymmetry 
of the above six skulls is measured by the numbers in the (a) column of Table VI. 

TABLE VI. 


Mean Square Distance of the Mid-sagittal Points from five Planes which may be treated as Mid-sagittal. 


Skull 

(«) 

Plane of Maximum 
Symmetry as 
Standard Mid- 
sagittal Plane 

(*) 

Plane perpendicular 
to Auricular Axis 
at Mean Elevation of 
Mid-sagittal Points 

M 

Plane perpendioular 
to Auricular Axis 
through 
Mid-porion 

Usual Mid-sagittal 
Plane, or Plane 
through Nasion, 
Bregma and Lambda 

M 

Martin’s Mid- 
sagittal Plane, or 
Plane through Nasion, 
Inion andBasion 

Negro (Teita) ... 

1*1451 

2*7498 

2*7593 

5*7284 

32*0419 

An0 (Nubw) l>Ul1 } 
Fuegiau 

1*2949 

1*5245 

2*4443 

1*5167 

1*5909 

1*6768 

2*8727 

4*8327 

8*5685 

62*4137 

Arab (Modern) ... 

1*7280 

9*6393 

11*1308 

4*0284 

2*5998 

Hindu (Bengal) ... 

2*2452 

2*4083 

2*8062 

31*1950 

36*5764 

English 1 
(17th century)/ 

2*9204 

3*6529 

4*6923 

27*3563 

4*0257 

Mean for six skulls 

1*8351 

3*791 2 s 

4*7609 

13*0055 

23*0414 


To allow for the absolute size of the skull I have taken three lengths of the skull 
in directions at right angles, choosing these lengths from the plan and elevation 
drawings rather than from calliper measurements of the skull. I have taken : 

(a) The Bi-porionic Distance ; this is a length on the axis of z. 

(b) The perpendicular from the Apex on the auricular axis. This is the x 
coordinate of the Apex. 

(c) The length of the projection of the line joining Nasion to Kappa onto the 
axis of y, i.e. onto the line joining the plan of Kappa to the plan of the Foria. 

I have then squared the cube root of the product of (a), (6) and (c), and thus 
obtained’ a quantity depending on the squared linear dimensions of the skull, by 
which it seemed that the mean square residuals might be divided so as to obtain 
a reasonable index independent of the dimensions of the skull. The resulting 




242 


The Cranial Coordinatograph 

number has been multiplied by 100. In the case of our six skulls the following 
Table gives the resulting indices. 


TABLE VII. 

Indices of Medal Asymmetry . 



(«) 

W) 

<7) 

<i) 

Skull 

Minimum 
Mean Square 

Product of the Cranial 
Rectangular Lengths 

1 (" ) x (h) v. (<•);! 

= Index 
(7) 


Residual 

(a)x(hjx(fi) 

Negro (Teita) 

11451 

2,075,509*080 

162711 

70-38 

Ancient Egyptian (Nubia) 

1-2949 

2,513,682*171 

18487-3 

70-04 

Fuegian 

Arab (Modern) 

1*6768 

2,664,900*480 

19220*5 

87 ‘24 

1 *7280 

2,245,151*580 

17146-0 

100-78 

Hindu (Bengal) 

2*2452 

2,050,821 *568 

16141-8 

139*09 

English (17th century) ... 

2*9204 

2,694,690*504 

19364-5 

149*88 


Only one change has been made in the order of the crania by reducing the 
minimum mean square residuals to an index independent of absolute size. It 
would be foolish to draw any conclusion as to the relative position of the races to 
which these six skulls belong from the order of either column (a) or (8) of this 
Table, but the Table may suggest interesting problems, which might be followed 
up, if considerable racial groups were worked out*. 1 have contented myself here by 
showing the manner in which numerical results may be obtained. 

9. The remaining Standard Planes . 

If we accept the view that the Plane of Maximum Symmetry is the most 
suitable plane to take as a mid-sagittal plane, and we term it our First Standard 
Plane, the question immediately follows : How are we to select our remaining 
standard planes ? These will correspond respectively to the Horizontal Plane and 
Transverse Vertical Plane of the skull, but they must be chosen so as to be at 
right angles to one another and to the Plane of Maximum Symmetry. 

Now the process described in my paper in the Philosophical Magazine leads 
to three mutual rectangular planes defined by Si* and by Sa*, and Ss* the other 
two roots of the cubic. S** and S 8 * are easily found from a quadratic equation and 
are maxima values of the Mean Square Deviations. It might at first be supposed 
that these are the very planes we need. But we must remember that the first 
standard plane has been obtained on the basis of its approximation to the mirror 
plane, and only the mid-sagittal points have been used in its determination. It is 
accordingly somewhat one-sided to use solely these points in determining planes, 
which we need only limit as planes necessarily perpendicular to the Plane of 
Maximum Symmetry. Unfortunately when we come to the Horizontal Plane, and 

* For example : Does asymmetry increase as we pass from more primitive to more highly oivilised 
groups? 



Karl Pearson 


248 


the Transverse Vertical Plane, we have no long series of points like the mid- 
sagittal points from which to determine approximate planes. The Apex and the 
Kappa are not natural points of the skull, but artificial points resulting from the 
definition of the Frankfurt Horizontal Plane. The Poria may be considered more 
nearly natural points although the Frankfurt Plane enters into their determination. 
The Orbitalia may be considered again as only in part natural points, as they depend 
upon the Poria, and the lowest points on the orbital rims are really meaningless, 
without the conception of the Horizontal Plane. But there is a greater difficulty 
arising here. When the skull is adjusted on the craniophor to the Frankfurt 
Horizontal Plane it will be found that in some cases several millimetres of the 
lower borders of the orbits may practically be parallel to that plane. This may not 
modify seriously the plans of the Orbitalia, but it renders their elevations above 
the drawing-board when the auricular axis of the skull is made perpendicular to 
the board, occasionally difficult of accurate determination. 

Thus although it is perfectly easy to develop the mathematical solution for 
determining a plane perpendicular to a given plane, and fitting closely by a 
minimum square residual to a selected system of points — assumed in the case of 
a skull truly symmetrical to lie in that plane — yet in practice it is not easy to 
select such a system of points in the case either of the Horizontal Plane, or of the 
Transverse Vertical Plane. 

1 will not give here the mathematical theory by which a plane is constructed 
perpendicular to a given plane and of closest fit to a selected series of points, but 
merely indicate what resulted in the case of the Fuegian skull. The points selected 
were : 

the two Poria (0, 0, 253*8) and (0, 0, 126*2), 
and the two Orbitalia (13*3, 82*4, 219*0) and (5*1, 82*4, 145*8). 

The equation to the plane perpendicular to the Plane of Maximum Symmetry 
of this skull and of closest fit to the above four points is* 

*967,9125 x - *250,8948 y 4* *014,0420 * + 13*946,272 * 0. 

This plane makes an angle of 14° 33'*2 with the plane of x =» 0, and is accordingly 
not very close to the Frankfurt Horizontal Plane. Clearly the plane perpendicular 
to our First Standard Plane and fitting closely to the Poria and Orbitalia is not, for 
this skull at any rate, at all approximated to by the Frankfurt Horizontal Plane. 

The above plane is the plane of Least Square Residual from the Poria and 
Orbitalia which is perpendicular to the First Standard Plane. The plane of 
maximum mean square deviation subject also to the perpendicularity condition is 

*251,0725 « + *967,8841 y- 012,7715*- 15*889,862 *0. 

This corresponds to the Transverse Vertical Plane, but it makes an angle of 
14° 33'*6 with it. 

* The plane determined in this way passes through the centroid of the Poria and Orbitalia. 



244 


The Cranial Goordinatograph 

These results are not satisfactory, if we wish our second and third standard 
planes to approximate fairly closely to the Frankfurt Horizontal Plane and the 
customary Transverse Vertical Plane. Accordingly, it seemed worth while investi- 
gating whether the other two roots of the fundamental cubic would give a system 
of rectangular planes with adequate approximation to the Frankfurt Horizontal 
Plane and to the customary Transverse Vertical Plane. 

In the case of the Fuegian skull the three roots of the cubic are 

Si* -1*676,8435, 

SV- 5121*492,8812, 

- 2848*293,3502. 

2** with the greater maximal value of the Mean Square Residual led to the plane 
*4312,1286# + *9021,9390 y — *0100,8176 z — 12*8930,3076. 

This plane is closest to the Transverse Vertical Plane, or the reference plane y — 0, 
but it makes an angle of 25° 33'* 1 with it. 

The other maximum 2a* provides the plane 

- *9021,9942 # + *4310,4409 y - *0154,0415 * - 22*6879,0655, 
which corresponds to the Frankfurt Horizontal Plane, but makes an angle with it 
of 25° 33 ,# 1, equal to that which the previous plane makes with the Transverse 
Vertical Plane. This did not seem very hopeful, but the like planes were worked 
out for the Negro skull from the Teita Hills The three roots of the fundamental 
cubic (given on p. 237) are 

2x* - 1*1451,0147, 2,* - 4918*7879,4908, 2 8 * « 2430*9817,4235. 

2 a * gives us the equation 

*0872,1082 x + *9961 ,37 65 y + ‘0102,0028*- 9*548,7522. 

This plane is closest to the Transverse Vertical Plane, or y — 0; it, makes an angle 
with it of 5° 3' *2. 

2 a * gives us 

- *9959,4649 a + *0874,1 127 y - *0212,0966 * - 9*397,5540, 

which makes an angle of — 5° 9'*6 with the plane of x — 0, or the Frankfurt Hori- 
zontal Plane. These results for the Negro skull are better than those for the 
Fuegian, but are not satisfactory, if we desire our Second and Third Standard Planes 
to approach fairly closely to the Frankfurt Horizontal Plane and the customary 
Transverse Vertical Plane. 

No doubt the three roots of the fundamental cubic would provide three standard 
mutually rectangular planes possessing certain very definite physical relations with 
regard to any skull, but they would fail to provide close approximations to the 
Frankfurt Horizontal Plane. Accordingly, I started on a different route to find my 
Second Standard Plane. I sought a plane which should be at right angles to the First 
Standard Plane, i.e. the Plane of Maximum Symmetry, and should have a maximum 
cosine, i.e. a minimum angle with the Frankfurt Horizontal Plane. This should 



Karl Pearson 


245 


form the Second Standard Plane. The Third Standard Plane must be perpendicular 
to the First and Second Standard Planes, and this will fully determiue its direction 
cosines. 

Mathematically: let X, y t v be the direction cosines of the Second Standard 
Plane; Z, m, n those of the First Standard Plane, and X', y\ v' of the Third Standard 
Plane. Let L t M t N be the direction cosines of the plane with which X, y t v is to 
make a minimum angle 6 or a maximum cosine a » cos 0. Then 

^ u « L\ -l- My, 4* Nvy 

0 = ZX 4 my 4 nv , 

X 2 4 y* 4 v* * 1. 

Hence for a maximum of u, it' A and S be indeterminate multipliers 
l + AL + B\ — Qy m + AM+By~Q> n + AN + Bv=*Q. 

Whence 1 4 -d (LI 4 Mm 4 Nn) «* 0, A w 0 4 B ® 0. 

Accordingly, L ( 1 — Z 1 ) — Mlm — Nnl =* Xuo 

— Lml 4 M (1 — m 2 ) — Amn * /amo 
— Ltd — J/nm 4if(l~n l )« i/Wo. 

Whence 


L (I — Z a j - Mlm — iVZw — Lml 4 M (1 - m % ) — Nmn 

= - 1 

— Lnl- Mnm + N (l —n % ) w© 

which solve the general problem. 

But for our special case: L — 1, M = 0, W ** 0, and thus 
X __ y v 1 1 

1 — Z 2 — mZ - TiZ « 0 Vl — 7 s 

Im In 


Thus 


X = Vl— Z* y = — 


VI - z a> Vf^T 1 

these determine the direction cosines of the Second Standard Plane. 
For the Third Standard Plane we have 

ZX' 4 my' 4 ni/ « 0 

and XX' 4 yy 4 vv * 0, 

X' _ y' _ y' 

or mv—my n\~lv ly — mX 

Or again substituting for X, /a, v, 


X' 


T7 


"=TGT 


0 ^ 

Thus X' must ** 0, and 

/ _ v' 1 


.(xiii), 


. (xiv), 



246 


The Cranial Coordmatograph 


Accordingly, we have finally: 

v-o, .(«>. 

We have thus reached on our hypothesis fully determined directions of the three 
standard planes. 

The reader will observe that our Third Standard Plane is invariably at right 


angles to the plane of x » 0, since X'« 0, that is to the Frankfurt Horizontal Plane. 
We may now test how satisfactorily this arrangement works on our six illustrative 
crania. * 


(A) Fuegian Skull . 

f « - *0095,7040, m - 4- 0157,3352, rc= + *9998,3036. 

Direction Cosines of Second Standard Plane: 

X - *9999,5420, fi « + 0001,5060, v « + 0095,6921. 

The Second Standard Plane makes an angle of 0° 32'*9 with the Frankfurt Hori- 
zontal Plane. 

Direction Cosines of Third Standard Plane: 

X' = 0, fjJ » + *9998,7615, v « - 015,3424. 


The Third Standard Plane makes an angle of 4- 0 C 54' *1 with the customary 
Transverse Vertical Plane. 


( B ) Ancient Egyptian Skull from Nubia . 

i- 4- *005,4027, *006,2292, n - + *999,9660. 

Direction Cosines of Second Standard Plane : 

X » *9999,8540, p » + *0000,3365, v - - *0054,0260. 

The Second Standard Plane makes an angle of 0° ll'*6 with the Frankfurt 
Horizontal Plane. 

Direction Cosines of Third Standard Plane: 

X' = 0, / « + *9999,8060, v' » + *0062,2929. 

The Third Standard Plane makes an angle of 4- 0° 21'*4 with the customary 
Transverse Vertical Plane. 


(C) Arab Skull. 

I = - *0040,6975, m = - *0423,9970, n » + *9990,9^44. 

Direction Cosines of Second Standard Plane: 

X = *9999,9127, - *0001,7256, * - + *0040,6609. 

The Second Standard Plane makes an angle of 0° 14'*3 with the Frankfurt 
Horizontal Plane. 

Direction Cosines of the Third Standard Plane : 

X'«0, 4* *9991,0116, v' = 4- *0424,0007. 

The Third Standard Plane makes an angle of 2° 25'*8 with the customary 
Transverse Vertical Plane. 



Karl Prarson 


247 


( D ) Teita Negro Skull. 

I «- 0220,1986, 0083,0922, n = + -9997,2301. 

Direction Cosines of Second Standard Plane : 

X = 9997,5754, /*= - -0001 ,8301, v = - 0220,1860. 

The Second Standard Plane makes an angle of 1° 15' '7 with the IVankfurt 
Horizontal Plane. * 

Direction Cosines of Third Standard Plane: 

X' — 0, /i'~+ -9999,6546, v' - + 0083,1 123‘. 

The Third Standard Plane makes an angle of + 0° 28' -5 with the customary 
Transverse Vertical Plane. 

(E) nth Century English Skull. 

I = + -0081,21 36, m = - 0109,6533, n = + 9999,0690. 

Direction Cosines of Second Standard Plane : 

. X - -9999,6702, y =+ 0000,8906, - 0081,2087. 

The Second Standard Plane makes an angle of 0° 27'-9 with the Frankfurt 
Horizontal Plane. 

Direction Cosines of Third Standard Plane : 

X' = 0, /= + -9999,3988, v' -+ -0109,6569. 

The Third Standard Plane makes an angle of 0° 37 ,- 7 with the customary 
Transverse Vertical Plane. 

(F) Hindu Skull. 

I = - -0017,4154, m - - -0054,1230, n = + -9999,8384. 

Direction Cosines of Second Standard Plane : 

X = -9999,9848, g. = - -0000,0943, v = + -0017,4151. 

The Second Standard Plane makes an angle of 0° 6'-0 with the Frankfurt 
Horizontal Plane. 

Direction Cosines of Third Standard Plane : 

X' - 0, fi' - + -9999,8536, v - + -0054,1231. 

The Third Standard Plane makes an angle of 0° 18'-6 with the customary 
Transverse Vertical Plane. 

As a result we see that our Second Standard Plane as defined above makes a 
mean angle (28'-l) for the six skulls with the Frankfurt Horizontal Plane of less 
than half a degree, while the Third Standard Plane makes a mean angle (51'-0), less 
than a degree, with the customary Transverse Vertical Plane. Thus with the above 
definitions of the Second and Third Standard Planes we reach planes which make 
respectively only small angles with such very familiar planes as the Frankfurt 
Horizontal Plane and the Transverse Vertical Plane. 



248 


The Cranial Goordinatograph 

One point alone remains unsettled, namely: We have determined by (xiv) and 
(xv) the direction cosines of these Standard Planes as functions solely of the 
direction cosines of the Plane of Maximum Symmetry. But we have not determined 
the meet of our three Standard Planes. This remains to be selected. After dealing 
with several points I came to the conclusion that the most suitable point to choose 
as origin of the Standard Planes was the point in which the First Standard Plane 
or Plane of Maximum Symmetry meets the auricular axis/ The elevations of these 
points are respectively Fuegian, 189*054,163; Egyptian from Nubia, 188*671,058; 
Arab, 193*357,334; Teita Negvo, 186*435,955; English, 196*758,098; Hindu, 
186*608,610. 

Paying attention to which Porion was uppermost (as noted on pp. 236 — 238) we 
find that in no case is the intersection more than 1*5 mm. from the Mid-porion, the 
maximum being 1*4427 in the case of the Arab skull. Four intersections deviate 
from the Mid-porion towards the Left Porion and two towards the Right Porion, 
the average of the deviations of the six intersections is only *0136 mm. towards the 
Right Porion. The point chosen seems therefore a reasonable one, and enables us 
to write down very readily the equations to the Standard Planes. For our six 
skulls they are as follows : 

(A) Fuegian Skull . 

1st Standard Plane: 

- *009,5704 x + *015,7335 y + *999,8304 * - 189*022,082, 

2nd Standard Plane : 

*999,9542 x + *000,1506 y + *009,5692 s » 1*809,0990, 

3rd Standard Plane: 

- *999,8762 y + *015,3424 * - 2*900,5446. 

(B) Egyptian Skull (from Nubia). * 

1st Standard Plane : 

*005,4027 x - *006,2292 y + *999,9660 * = 188*664,644, 

2nd Standard Plane : 

- *999,9854 a? - 000,0337 y + *005,4026 z - 1*019,3143, 

3rd Standard Plane : 

•999,9806 y + *006,2293 z - 1175,2867. 

(0) Arab Skull . 

1st Standard Plane: 

- *004,0698 x - *042,3997 y + *999,0924 193*181,851, 

2nd Standard Plane: 

•999,9913a?- *000,1726 y + *004,0661 *786,2103, 

3rd Standard Plane: 

*999,1012 y 4- *042,4001 * - 8198,3145. 



Karl Pearson 


249 


(D) Teita Negro Skull. 

1st Standard Plane: 

- 022,0194 0 - •008,3092 y+ '999,7230 1 - 186-384,814, 

2nd Standard Plane: 

- 099,7675 m + 000,1830 y + 022,0186 * - 4-106,0687, 

3rd Standard Plane: 

•999,9666 y + 008,3112 1 - 1-649,6066. 

(E) English Skull. 

1st Standard Plane: 

•008,1214 0 - 010,9663 y + -999,9069 z «= 196-739,780, 

2nd Standard Plane: 

- -999,9670 0 - 000,0891 y + 008,1209 z - 1 -697,8469, 

3rd Standard Plane: 

•999,9399 y + 010,9667 $ « 2-157,6883. 

(F) Hindu Skull. 

1st Standard Plane: 

- 001,7415 0 - -005,4123 y + -999,9838 z « 186605, 594, 

2nd Standard Plane: 

•999,9985 a> - -000,0094 y + -001,7415 z = -324,9808, 

3rd Standard Plane: 

•999,9854 y + *005,4123 2 = 1009,9836. 

The reader must bear in mind what these three Standard Planes signify: 

The First Standard Plane is the Plane of Maximum Symmetry, or our nearest 
approach to a mirror plane, the plane which deviates least from the mid-sagittal 
points as judged by the Mean Square Residual. 

The Second Standard Plane passes through the point where the first meets the 
auricular axis, is perpendicular to the first and makes the minimum angle with the 
Frankfurt Horizontal Plane, which fails in the condition of being perpendicular to 
the First Standard Plane. Thus our Second Standard Plane may be looked upon 
as an improved “Horizontal Plane.” 

The Third Standard Plane passes through the point where the first two meet 
the auricular axis, and is perpendicular to both of them. It may therefore be looked 
upon as an improved Transverse Vertical Plane. 

The Second and Third Standard Planes will not as a rule pass through the 
auricular axis and neither will generally contain the Poria. The second will not 
usually pass through the Kappa, nor the third through the Apex. It may be not 
without interest to measure the mean departures of the Second Plane for our six skulls 
from the Poria and the Kappa, and of our Third Plane from the Poria and>the Apex *. 

* In the equations to the planes, as on pp. 286—288, it is needful to go to 6 or 7 decimals in order 
to get values of the direction oosines which will provide angles to a decimal of a minute. The values, 
being worked from those equations, are written down to the same number of decimals, but to use 
two or three decimal places in the distances is of course ample. 

Biometrika xxv 17 



250 


The Cranial CoordimUograph 


(-4) Fuegian Skull . 

2nd, or Horizontal Standard Plane. 

The k is above this plane with the perpendicular upon it ** — *000,4302, The 
R. Porion is above the plane — *619,5640 and the L. Porion below it + *601,4660. 
Thus, as judged by the Poria, the right ear is about *6 above, and the left ear 
about *6 mm. below the Standard Horizontal Plane. 

3rd, or Transverse Vertical Standard Plane. 

The Apex is + *008,5022 in front of this plane which like the deviation of* the 
k is not sensible on the skull. The R. Porion is behind this Transverse Vertical 
Plane — *993,3565 and the L. Porion in front of it 4* *964,3337. Thus the left ear 
conies horizontally forwards, and the right ear retreats. 

( B ) Nubian Egyptian Skull . 

2nd, or Horizontal Plane. 

The tc is above this plane at — *000,0522 again an insensible difference. The 
R. Porion is above it at — *315,1279, and the L. Porion is below it at +*302,3893* 
Again the right ear is raised above the Horizontal Plane, and the left lowered 
below it although the differences are only about *3 mm. 

3rd, or Transverse Vertical Standard Plane. 

The Apex is at —*006,4115 from this plane; the R. Porion is —*363,3504, 
i.e. behind, and the L. Porion about + *348,6586, i.e. in front of it, or the Poria are 
about one-third of the distance from this plane of the Poria in the case of the 
Fuegian skull. 

(O) Arab Skull . 

2nd, or Horizontal Standard Plane. 

The k is here +*002,1754, practically at an insensible distance below the Hori- 
zontal Plane. The R. Porion is below at +*241,7595 and the L. Porion is above 
the Horizontal Plane at - *253,4915. v 

3rd, or Transverse Vertical Standard Plane. 

The Apex is behind this plane at a distance - *116,3451. The R. Porion is 
+ 2*520,9411, i.e. behind, and the L. Porion -2*643,3911, i.e. in front of the Transverse 
Vertical Plane. These are the most considerable deviations we have found for any 
of the six skulls. 

(Z>) Teita Negro Skull . 

2nd, or Horizontal Standard Plane. 

The k here is at a distance + 047,0995 below this plane. The R. Porion is 
4* 1*229,4295, i.e. below the Standard Horizontal, and the L. Porion is — 1*219,0388, 
i.e. above it. 

3rd, or ^Transverse Vertical Standard Plane. 

The Apex is in front of this plane at a distance + *017,7523. The R. Porion 
is behind the Standard Transverse Vertical at + *464,0688, and the L. Porion 
in front of it at — *460,1417. 



Karl Pearson 


251 


(E) English Vlih Century Skull. 

2nd, or Horizontal Standard Plane. 

The k is here —*017,8341 above the Horizontal Plane. The R, Porion is 
at —*503,0299, i.e. above, and the L. Porion at 4- *515,3309, i.e. below it. 

3rd, or Transverse Vertical Standard Plane. 

The Apex is behind this plane at a distance of — *032,2620. The R. Porion is 
at —*679,2383, i.e. in front, and the L. Porion at 4 - *695,8605, i.e. behind the 
Transverse Vertical Plane. 

(F) Hindu Skull . 

2nd, or Horizontal Standard Plane. 

The k is + *001,3340 below this plane. The R. Porion is at 4- *097,8892, i.e. 
below, and the L. Porion at — *095,7656, i.e. above the Horizontal Plane. 

3rd, or Transverse Vertical Standard Plane. 

The Apex is behind this plane at — *004,2814 distance. The R. Porion is at 
4- *304,2197, i.e. in front, and the L. Porion is at — *297,6281, i.e. behind the Trans- 
verse Vertical Plane, 

Reviewing these results as a whole, we conclude as follows: 

Our Standard Horizontal and Transverse Vertical Planes, which we believe to be 
more reasonable than the usual Frankfurt Horizontal and Transverse Planes because 
they are at right angles to the Plane of Maximum Symmetry, do not pass through 
the auricular axis, but the first passes very nearly through the Kappa, the average 
distance in the cases of our six skulls from it being only *0115 mm. regardless of 
sign. We have previously seen that our Standard Horizontal Plane passes, with a 
mean deviation of *0186 mm. only, close to the Mid-porion (p. 248). The Frankfurt 
Horizontal Plane also passes through these two points. If now the Frankfurt 
Horizontal Plane be turned round the line joiriing these two points as axis, it will 
pass to the position of the Standard Horizontal Plane, one Porion approaching it 
and one receding from it. This provides the difference in vertical height of the two 
Poria, or speaking popularly of the two ears. 

The Second Standard Plane passes just as closely to the Mid-porion and very 
close to the Apex, i.e. at an average distance of *0309 mm. from it. It is also per- 
pendicular to the Frankfurt Plane. Accordingly if the usual Transverse Vertical 
Plane be rotated about the line joining Apex to Mid-porion through a small angle 
it will come nearly into the position of our Standard Transverse Vertical Plane. 
The rotation is, to judge from our six skulls only, somewhat greater than the 
rotation required to change the Frankfurt Horizontal into the Standard Horizontal 
Plane. The result of this rotation is to cause one Porion to retreat behind the 
Standard Transverse Plane and the other to advance in front of it. Popularly, one 
ear may be said with reference to the Plane of Maximum Symmetry to be farther 
back on the head than the other ear. 


17—2 



The Cranial Gwrdmatograph 

This more detailed numerical investigation indicates how the ears are displaced 
vertically and horizontally with regard to homologous positions as judged from 
a good representative of the mirror plane. Of course this solely emphasises in 
a different manner the point we had already reached, namely that the auricular 
axis is not perpendicular to the Plane of Maximum Symmetry. 

The auricular axis of the skull is invaluable as-an aid to the determination of 
cranial planes of reference, but it is far from an essential base line when we are 
investigating the symmetry of the skull. It cannot be legitimately used in deter- 
mining standard horizontal and vertical planes, for in doing so we are assuming 
that it is necessarily perpendicular to a well-chosen Median Sagittal Plane, and this 
it is not. 

Summary . 

By aid of the Cranial Coordinatograph it has been possible to obtain the 
coordinates of any point on the skull referred to three rectangular planes of 
reference. Thence we obtain the analytical equations of solid geometry to any lines 
or planes of the skull. Discussing the “mirror plane” of an absolutely symmetrical 
skull, we were led to find as our First Standard Plane the plane of the Mininum 
Square Deviation from the “mid-sagittal” points. This plane is that of maximum 
mesial symmetry. The Second and Third Standard Planes of the skull must be at 
right angles to each other and to the Plane of Maximum Symmetry. To make as 
little change as feasible we took our Second Standard Plane to make a minimum 
angle with the Frankfurt Horizontal Plane. This second plane practically passes 
through the Kappa and the Mid-porion, but is tilted to the Frankfurt Horizontal 
Plane. It is a truer Standard Horizontal Plane than the Frankfurt in that it is 
perpendioular to the close fitting Mid-sagittal Plane. Our Third Standard Plane is 
perpendicular to the first two and is a truer Transverse Vertical Plane than that 
through the auricular axis. These two standard planes indicate* well the actual 
shift of the ear-orifices forwards and backwards, upwards and downwards, being, as 
we might naturally anticipate, an effect of the general asymmetry of the skull. The 
ears, being at a maximum distance from the Mid-sagittal Plane, show more than 
any other part of the skull, or of the living head, a maximum displacement from 
a symmetrical position, and really preclude the use of the auricular axis for the 
determination of a true horizontal, or a true transverse vertical plane. 

The reader may ask: How much labour is involved in reducing a skull to 
analytical solid geometry ? Originally it took me about two hours to two hours and 
a half to determine the Poria, Kappa and Apex on a Ranke craniophor, to adjust 
the skull on the staddle and read off the coordinates of 15 to 20 points. But with 
practice I think two skulls could easily be done in a morning of less than three 
hours. The numerical work is very straightforward computing, but laborious. The 
solving of the cubic is not so lengthy as it may appear, as we can be fairly sure that 
2i a will lie between 1 and 3 sq. mm; Because its value is so small and the otheiv 
roots of the cubic so large it is needful to keep a considerable number bf decimal 



Biometrika, Vol. XXV, Parts III and IV 

Pearson: The Cianiai Cooi din ato^rafh and the Standard Planes of the Slid! 









## <*** 


(d) Hindu Skull. Norma lateralis (Left Profile). 

The white line is tlie trace on this skull of the customary Transverse Vertical Plano ; 
the horizontal black lino is the trace of the Frankfurt Horizontal Plane. 







Plate III 


Biometrika f Vol. XXV, Parts III and IV 

Pearson: The Cranial Coordinatograph and the Standard Planes of the Skull 




(f) Hindu Skull. Norma basa/fs. 

The white line is the trace of the customary Transverse Vertical Plane ; the horizontal black line 
is the trace of the plane through the Mid-porion perpendicular to the auricular axis. 









Plate IV 


Biometrika, Vol. XXV, Parts III and IV 

Pearson : The Cranial Coordinator a/th ami the Standard Planes of the Skull 



D lan and Elevation Model of a Fuegian Skull. 



Plan and Elevation Model of an Egyptian Skull from Nubia. 

















Biometrika, Vol. XXV, Parts III and IV Plate VII 

Pearson : The Cranial Coordinatograph and the Standard Planes of the Shill 



(a) First form of Cranial Coord inatograph with Skull on Skull Staddle and auricular axis 
vertical. Klaatsch and Pearson independent projectors, the former set at glabella. 








Biometrika, Vol. XXV, Parts III and IV Plate VIII 

Pearson : The Cranial Coordmatograph and the Standard Planes cf the Skull 



(ii) Symmetricised Drawing Drawing of Cromwell’s Symmetricised Drawing of 

of Right Side, Death Mask. Left Side. 




Biometrika, Vol. XXV, Parts III and IV Plate IX 

Pearson: The Cranial Coordinatograph and the Standard Planes of flu Skull 



This plate indicates how much of personal character depends on asymmetry of the face. 










Karl Pkarson 


253 


places. I think that two skulls could be dealt with in 5 to 6 hours by a good 
computer, so that an investigator could make a daily output of two skulls, or 10 to 
12 a week. A month to five weeks would complete a racial sample of some 50 crania. 
Thus 10 to 12 races could be studied in a year to eighteen months. This is less 
time than many students give to their thesis for a doctorate, and a most valuable 
study of the standard planes of the skull and its asymmetry would result. 

The present paper only professes to be illustrative of what may be achieved by 
the use of a coordinatograph, and the application of solid geometry to craniometry. 

I may be over-enthusiastic, but I unhesitatingly believe that there is a most 
promising field for the craniometricians who are the first to apply Cartesian 
geometry to the skull. 



A STUDY OF TWELFTH AND THIRTEENTH DYNASTY 
SKULLS FROM KERMA (NUBIA). 

By MARGOT COLLETT. 

(Crewdson Benington Student.) 

(1) The Discovery of the Skulls at Kerma . The skull series forming the subject 
of the present paper came from Kerma, a place lying on the east bank of the Nile 
between Argo and Tombos (see Fig. I). Kerma is 150 miles south of the boundary 
which divides the Anglo-Egyptian Sudan from Egypt and as the crow flies it is 
almost as far from Kerma to Thebes, a distance of nearly 450 miles, as it is from 
Thebes to Alexandria. The excavations were carried out by Dr George A Reisner 
leading the joint expedition of Harvard University and the Boston Museum of 
Fine Arts in the seasons 1913-14 and 1915-16, and the following particulars are 
taken from his report*. No certain date for the occupation of Kerma is given 
earlier than the Twelfth Dynasty, but objects of earlier date have been found there 
and the settlement is believed to be one of great antiquity. The ruins of mud 
houses discovered under the foundations of buildings of Middle Dynastic date show 
clearly that there had been a settlement of considerable size in earlier times. Taking 
into consideration the local history of other parts of Nubia, it is most probable that 
during the Old Kingdom (1st — 8th Dynasties) and the Early Middle Kingdom 
(9th — 11th Dynasties) Kerma was one of the numerous Egyptian trading stations 
held by an agent and a few men. possibly local natives, at which the periodic 
expeditions from Egypt called to deliver and collect goods. These expeditions 
were sent out every two or three years carrying ointment, Ijoney, faience and 
cloth, which were taken in exchange for resin, ivory, woods, oils, special grain, 
incense and leopard skins. The discovery of statuettes and pottery on which were 
inscribed the names of Pepy I and II, who were kings of the Sixth Dynasty, and 
of Amenemhat I of the Twelfth Dynasty, gave the first real evidence for purposes of 
dating. It is suggested that the settlement was slightly increased in the reign of 
Amenemhat I after the quelling of a native revolt. Dr Reisner gives 1970 B.c. 
as the date of this revolt. Later, in the reign of Sesostris I (also of the Twelfth 
Dynasty), and probably just after another native rising, a larger force was sent out 
from Egypt under the governorship of Hepzefa who was probably responsible for 
the erection of a fortress or fortified residence known as the Western Defffifa. This 
building was the nucleus of a military settlement. It was of considerable size, 
containing a guard-room and several other rooms, some of which may have been 
used as storehouses. It is thought that the Defffifa was built principally as a strong- 
hold for the protection of goods brought from Upper Egypt for purposes of exchange, 

* “Excavations at Kerma,’’ VoL i (Parts i — hi) and Yol. n (Parts iv — v). Harvard African Studies , 
Vole, v and vi (1928). 



Margot Collett 


255 


and of the taxes collected from the local tribes. Owing to the lack of water nearby 
there would not have been a convenient harbourage for a large armed force in the 
building. 



This governor (Hepzefe), besides being the builder of the Deff&fa, is believed 
to have been the founder of the Egyptian Cemetery in which we are chiefly 
interested. It remained in use for over 350 years. In the neighbourhood of the 





256 A Study of 12 th and 13 th Dynasty Skulls from Kerma 

Deff&fa were remains of only two other buildings, both being funerary chapels, one 
of which was attached to the earliest tumulus (K hi) and the other to one of later 
date (K x). The potteiy and grave furniture throughout this cemetery, the .painting 
in the second chapel, the seals, and the numerous statuettes were all distinctly 
Egyptian in form and technique. On the other hand there were certain peculiarities 
quite unknown in the contemporary graves in Egypt proper. The principal among 
these peculiarities was the sacrificial, or so-called s&ti, burial when in some cases 
as many as 320 people appear to have been buried alive with the body of their 
chief. This custom was prevalent in Egypt during Predynastic and First Dynastic 
times; it was also common in the earliest Nubian graves and it appears to have 
been the custom at Ur in early times. It was not only practised by the ruling 
classes at Kerma ; the smaller independent graves of quite poor type also have one 
or possibly two human sacrifices. In the later graves, where the Nubian element 
became more marked, a ram or several rams seem to have been substituted for the 
human sacrifices in some cases, or there might be three rams and two human beings 
as opposed to one ram and four human beings in an earlier grave. The covering 
of the body with a cowhide shows Egyptian influence, but again the custom of bed 
burials was unknown in Upper Egypt at this time. In one of the later tumuli 
coffin burials were introduced contemporaneously with bed burials, another sign of 
Egyptian influence. The pottery became noticeably coarser in the later graves and, 
although developing along the same lines as in Egypt proper, a certain degenera- 
tion is evident. 

The cemeteries at Kerma can be divided into three groups : the earliest Egyptian 
Cemetery founded by Hepzefa, the Nubian Cemetery which follows a transition 
period of Egyptian-Nubian graves and the much more recent Third and Fourth 
Century a.d. Meroitic Cemetery. With the exception of a few skulls from the 
Meroitic Cemetery which have not been measured all our material comes from the 
earliest Egyptian Cemetery which was in use during the Twelfth and Thirteenth 
Dynasties. The following table gives the numbers of skulls of which measurements 
were taken belonging to different graves or tumuli, including the Meroitic skulls: 


No. of Tumulus 
or Grave 

No. of Skulls 

Dynasty 

Nature of 

Maiu 

Subsidiary 

Interment 

111 

11 

22 

12th 

Tumulus 

IV 

X 

13 

48 

42 

56 

ti 

13th 


XVI 

4 

16 



XVIII 

11 

6 



XXIX 

1 



Minor tumulus 

XX 

2 




XXIII 

10 



)« 

’’ » 
Tumulus 

XXXIV 

1 

— 

f) 

Minor tumulus 

XXXV 

1 




B 

9 



12th 

» »» 
Graves in open 

Meroitic 

4 

— 

3rd and 4th cent. a . d . 





Margot Collett 


257 


The tumuli throughout the cemeteries were all constructed on the same plan. 
They consisted of one or two main chambers centrally placed, one of which con- 
tained the chief body, a sacrificial corridor dividing the tumulus in half, and built 
at right angles to this corridor a series of parallel walls gradually decreasing in 
height towards the outside. The subsidiary graves were placed between these 
walls and they are thought to have held the bodies of the officials belonging to 
the court of the chief. The height of the walls where they met the walls of the 
corridor was about 3 ft. and it decreased to about 1 ft. on the outside edge. The 
whole erection was covered with a mound of earth and surrounded by a ring of ox 
skulls which were possibly the remains of a funeral feast, and also by a ring of 
black stones for which we have no explanation. In addition to these tumuli, which 
differed in size considerably, there were also several independent graves consisting 
of oval or rectangular pits and containing one, two or three bodies each. Both 
primary and sacrificial burials were made in each form of funerary chamber or 
grave and it is not possible from the records available to distinguish the two 
varieties in all cases. It is probable, however, that considerably more than half of 
the skulls dealt with were those of sacrificial victims. The tumuli were numbered . 
in chronological order as far as this was possible. The actual skeletons from the 
central corridor, representing the chief body and the sacrificial bodies, were lettered, 
and the skeletons from the subsidiary graves numbered. Of the total 310 skulls 
available 55 were excluded from the series measured for various reasons and this 
accounts for breaks in the serial numbering which was done in the Biometric 
Laboratory. The present paper deals only with the skulls, though many of them 
have mandibles and considerable numbers of the other bones of the skeleton have 
been preserved. 

The archaeological evidence was apparently only capable of giving a very 
incomplete history of the settlement at Kerma, and the account of the excavations 
throws little or no light on many important points. We may conclude, however, 
that the original trading colony, which was founded at an early but unknown date, 
almost certainly consisted of pure Egyptians, although there was an early non- 
Egyptian village of considerable size and also a fairly large Nubian village in the 
vicinity. The trading settlement Inebuw Amenemhat (Kerma) was known to be 
in existence in the Twelfth Dynasty and evidence of a settlement much earlier 
than this is given. Reisner says that “the colony at Inebuw Amenemhat was... 
on the site of an old trading post of the 6th Dynasty.’' After the advent of Hepzefa 
the number of colonists was not substantially, if at all, increased by the traders' 
from Egypt as far as we know, although there is no definite evidence on this 
point. Supposing there was no increase by immigration, the occupants must have 
interbred and remained in possession after the Twelfth Dynasty. The tenure was 
a military one and it is unlikely that there was much intermarriage with native 
peoples of the locality. The culture, as we have seen, remained essentially Egyptian 
in type until a much later period than that with which we are concerned, and the 
fact that a few peculiarities were observed such as the common practice of sacrifical 
burial and the use of a bed in burial cannot be accepted as evidence of blood 



258 A Study of 12 th and 13 $ Dynasty Skulls from Kerma 

admixture with any foreign element. It would appear that all these peculiarities 
were merely survivals of elements of an earlier Egyptian culture. The bulk of our 
material belongs to a comparatively short period, probably not exceeding 250 years. 
Reisner says: “For the period from the death of Hepzefa (Km) to that of the 
person buried in K xx I estimate an interval of about 200 years.” 

It is fortunate that our knowledge of the craniology of Upper Egypt from Early 
Predynastic to Roman times is quite extensive, since the original colonists almost 
certainly came from there. It is known that the physical type in the north was 
undergoing slight but continual modification, and we may hope to give a definite 
answer to the questions whether the community which was apparently isolated in 
Nubia was, during the period for which we have acquaintance with it, of the same 
physical type as the contemporary population of Upper Egypt, or whether it was 
of the same type as the Upper Egyptian population of another period, or whether 
it had acquired distinctive characters which would differentiate it from all known 
populations of Egypt. 

(2) The Nature of the Kerma Series. There are 310 skulls from Kerma pre- 
served in the Biometric Laboratory. Fifty-five of these were excluded from the 
series, and no measurements of them were taken, for various reasons : 31 are 
posthumously distorted, eight are too fragmentary to be measured, one is scapho- 
cephalic, seven are immature, four are of Meroitic date, two found in tumuli are 
so well preserved that they can only be supposed to represent modern intrusive 
burials and two others possess manifest negro characters. The remaining skulls, 
which are the only ones considered in later sections of this paper, are all in a 
similar state of preservation, which resembles that of most other Predynastic and 
Dynastic Egyptian series, and most of them are complete or nearly complete. With 
two exceptions, they all bear grave numbers, which assign them to the Twelfth 
and Thirteenth Dynasties, and these are given with the corresponding serial 
numbers in the appended tables of individual measurements. The s specimens which 
were not marked by the excavators (Nos. 224 and 272) are presumed to belong to 
the same period as the others. When the skulls were set out on a table so that all 
could be seen together, few striking differences in type were observed except in 
the case of the two with clear negro characters which were excluded from the 
series on this account. It was evident that if any division of the remainder was 
to be made by merely examining the skulls, such would have to depend on the 
fact that some possess more negroid characters — viz. a flatter nasal bridge, a higher 
nasal index and a greater degree of prognathism — than others. But an attempt 
to divide the whole into two contrasted groups showed that it was quite impossible 
to distinguish the negroid and non-negroid specimens with any degree of exact- 
ness. Hence it was concluded that the safest procedure was to treat the total series 
as if it represented a single racial type which would obviously be one possessing 
negroid characters. The series did not appear to be more heterogeneous than 
many which have been dealt with in this way. 

The sexing of the 255 adult skulls forming the series was done by Professor 



Margot Collett 


259 


Earl Pearson and Dr G. M. Morant. They distinguished 141 males and 114 females. 
This slight excess of the representatives of the former sex over those of the latter 
has frequently been found for a collection of excavated crania, and it has been 
supposed due to the fact that the weaker female specimens would be broken more 
easily and hence less likely to be saved by the excavators. In the present case 
many of the skulls are known to be those of sacrificial victims and this may also , 
account in part for the presumed disparity between the sexes. Sex ratios, i.e. male 
means divided by female means, have been given by Kitson * for two European, 
two negro and the long Egyptian E series of 26th — 30th Dynasty skulls. The 
Eerma values are closest to those for the Egyptians and Bantu negroes (Teita) 
from Eenya Colony and comparisons with these are made in our Table I. The 
correspondence of the sex ratios for these three series is extremely close and we 
may conclude that they were all sexed in a uniform way. 

TABLE I. 


Sew Ratios for the Kerma and other Series . 


Character 

Kerma 

Egyptian E 

Teita 

L 

1*045 

1*047 

1-046 

B 

1*029 

1*025 

1*024 

LB 

1*054 

1*053 

1*051 

B ' 

1*040 

1*027 

1 039 

H 

1*058 

1*038 

1*044 

8 

1*039 

1*034 

1*037 

U 

1*044 

1*038 

1*037 


A rough estimate of the age constitution of the cemetery population repre- 
sented can be obtained by considering the state of closure of the coronal, sagittal 
and lambdoid sutures. The results for the Kerma series are summarised below, 
those three calvarial sutures being the only ones referred to. 


Bex 

All sutures open 

Sutures beginning to 
close or partly closed 

All sutures dosed 

Total. 

$ 

16 (11-37.) 

83(68-9 7.) 

42 (29-8 7.) 

141 

9 

40 (36-1 •/„) 

51 (44-7 7.) 

23 (20-2 7.) 

114 


The percentages may be compared with those given f for six other series 
examined by Dr Morant. For every one of these marked differences were found 
between the corresponding male and female frequencies, confirming the fact that 
the sutures close at a later age for women than for men. The Eerma male 
percentage of skulls showing all three sutures open is lower than any previously 
given, and the percentage with all sutures closed is only exceeded by one other. 
The females show a smaller proportion with all sutures open and a larger with 
all sutures closed than has been found for any other series examined. The present 
* Biometriha , YoL xxm (1981), p. 275. + Biometrika , Vol. exit (1982), p. 170. 





260 A Study of 12 th and 18 th Dynasty Shulls from Kerma 

sample must hence be supposed to represent an older cemetery population than 
any of the others, although the contrary would have been expected owing to the 
fact that many of the people buried at Kerma met unnatural deaths as they were 
sacrificial victims. 

We may enquire next whether there is any evidence of a change in the racial 
constitution of the population of Kerma during the Twelfth and Thirteenth 
Dynasties. The male means were computed for each of these Dynasties and the 
crude coefficient of racial likeness of 0*92 ± *17 for 31 characters was found between 
them. No significant differences between the means were found. The coefficient 
differs significantly from zero, but it may be too high owing to the fact that the 
Egyptian E standard deviations were used in computing it in place of the standard 
deviations for each of the samples compared. In any case it is of such a low order 
that it was thought safest to conclude that there is no evidence of a change in the 
racial nature of the Egyptian settlers at Kerma during the period considered. 
The female coefficient was not calculated, but a few of the more important means 
for the two Dynasties are compared in the table below. 


Character 

Male 

Female 

Twelfth 

Dynasty 

Thirteenth 

Dynasty 

Twelfth 

Dynasty 

Thirteenth 

Dynasty 

100 B\L 

100 B/H' 

100 NBINH.R 
Nl 

71*8 (59) 
1010 (41) 

51*5 (48) 

64° *3 (45) 

72-6 (74) 
100-2 (50) 

51-6 (63) 
65°-4 (50) 

73-1 (37) 
101-8 (27) 

63-0 (28) 
67°-8 (21) 

73-4 (73) 
101-9 (59) 

53-4 (55) 
67°-4 (42) 


The differences between these means in the case of each sex are quite in- 
significant. All the Twelfth and Thirteenth Dynasty skulls from Kerma accordingly 
were pooled, and the means, standard deviations and coefficients of variation with 
their probable errors for the total sample are given in Table II for all the characters 
measured and for the indices and angles*. I was able to make use of a considerable 

* [Definitions of the measurements, which are denoted by the usual index letters, will be found in 
Diometrika , Vol. xx B (1928), pp. 862 — 864. A single change was made in the way the nasal height NH ' 
was measured. In recent craniometrio studies published in Biomctrika this has been taken, in addition 
to the Frankfurt nasal heights, from the nasion to the “baBe” of the anterior nasal spine with the object 
of providing a measurement whioh was supposed the same as the one defined by Broca and Martin and 
again in the Monaco scheme. While the definitions provided in these techniques are not perfectly clear, 
a re-examination suggested that a closer approach to the nasal height aotually taken by those who have 
followed them could be obtained in the following way. The lower margins of the pyriform aperture are 
first drawn as pencil lines and the Frankfurt heights are taken to these curved lines on either side. 
A third pencil line, whioh will be horizontal on a symmetrical specimen, iB then drawn so that it 
appears, when the skull is viewed in norma fadalu t to be a straight line joining the lowest points of 
the margins on either side. The point where this horizontal line meets the boundary of the anterior 
nasal spine — itself a vaguely defined line — on the left-hand side is supposed to be Martin’s nasospinale. 
The NH' given for the Kerma skulls is the chord from this point to the naBion, and in all oases it was 
found to be very close to the Frankfurt NH t L. The NH"b given by Miss Kitson for the Naga skulls 
in the present volume of Biomctrika were taken in the same way, but in earlier studies in the Biometric 
Laboratory the NH ' was to the * 4 base ” of the anterior nasal spine. — G. M. M.] 




Margot Collett 


281 


number of measurements which had previously been taken by Mr G. C. Dunning 
and the individual measurements are given in Tables VIII, IX. The nature of the 
sample can be estimated from the constants in Table II. It may be asked first 


TABLE II. 

Constanta of the Male and Female Kerma Series. 


Character 

Means 

Standard Deviations 

Coefficients of Variation 








Male 

Female 

Male 

Female 

Male 

Female 

C 

1383 *3 ±8*2 (72) 

1229*9± 7*5 (47) 

103*5 ±5*8 

86-815-3 

7-481 -39 

7*06± *42 

F 

183*3 ±*38 (138) 

177-01-36 (114) 

0*63 ±*27 

6-791-27 

3*62 ± *16 

3*27 ± *15 

L 

185*2 ±*30 (138) 

177’2± *35 (114) 

6*26 ±*25 

5*08 + *25 

3*38+ *14 

3-201 '14 

B 

133*6 ± *26 (136) 

129*8 ± *28 (112) 

4*52 ±*18 

4*37 ±*20 

3*38 ± *14 

3*37 ± *15 

B r 

92 *8 ±*23 (139) 

89*2± *27 (111) 

4*08 ±*16 

4*20± *19 

4*40+ *19 

4*71 ± *21 

B" 

1131 ±*34 (68) 

107-61-42 (60) 

4*13± *24 

4*85 + *30 

3*65+ *21 

4-511 '28 

Biaaterionic B 

104 *9± *29 (113) 

100*2 ±*24 (102) 

4*51 ±*20 

3*60 + *17 

4-261 ’19 

3*59 ± *17 

H' 

133-4 ±-32 (93) 

127*8± *31 (88) 

4*65 ±*28 

4*38 + *22 

3*48 ± *17 

3*43 ± *17 

H 

135-7±-34 (100) 

128-21-35 (77) 

5*01 ±*24 

4*56 + *25 

3*69 ± *18 

3*56 ± *19 

OH 

110*7 ±*26 (125) 

107-41-25 (91) 

4*25 + *18 

3*48± *17 

3-841 '16 

3*24+ *16 

LB 

101 *8 ±*26 (117) 

96*6 ±*27 (89) 

4*16 ± *18 

3*75 ± *19 

4*09+ *22 

3*88+ *19 

Si 

112*1 ±*30 (126) 

107-41-29 (109) 

4*90 ±*21 

4-63+ -21 

4-421 '19 

4*22 ± *19 

S 2 ' 

115*1 ±*34 (128) 

111*0±*35 (106) 

5 *78 ±*24 

6*26± *24 

5*02+ *22 

4-741 -22 

S 3 ‘ 

97*2+ *32 (124) 

94-31-31 (97) 

5*34 ±*23 

4*58± *22 

6*49+ *24 

4-861 -23 


128*1 ±*38 (126) 

123-31-39 (109) 

6*28 ±*27 

6*05 + *28 

4*90 ± *21 

4*91 ± *22 

Si 

128-7 + -44 (128) 

124‘0± *46 (106) 

7*46 ± *31 

7 *06 ±*33 

5-80+ -25 

6*69 ± *23 

S 3 

116*8 + *46 (124) 

1 12*9 ± *46 (97) 

7 *67 ±*33 

0*71 ± *32 

6*57+ *28 

5*94 ± *29 

8 

374*7 ±*77 (133) 

360*3 ±*80 (105) 

13*15+ *64 

1 2*13 ± *56 

3*51+ *15 

3*37 ± *16 

U 

510*2± *81 (133) 

488*9 ±*83 (107) 

13-891-67 

12*78± *54 

2-721 -09 

2*01 ± *12 

«' 

305-0±-64 (123) 

291 *8 ±*60 (89) 

8*98 ±*37 

8*33+ *42 

2*94+ *13 

2*86± *14 

fml 

36*4 ± *20 (112) 

34*6 ± *15 (92) 

3*13± *14 

2*14± *11 

8-601 '39 

0*18± *30 

fmb 

30-11-16(110) 

28-41-16 (83) 

2*42 ±*11 

2*12 ± *11 

8-041 '37 

7*46 ± *39 

G'H 

69*6 ± ’29 (111) 

65*7 ±*25 (83) 

4*48 ±*20 

3’33± *17 

6*45 ± *29 

6*07 ± *27 

GL 

96*5+ *32 (97) 

93*7 ±*38 (68) 

4*75 + *23 

4*63 ±*27 

4*92 ± *24 

4*94 ± *29 

QB 

95*3 ± *28 (111) 

90*9 ±*30 (87) 

4‘04± *21 

4*20+ *21 

4*87 ± *22 

4*62 ± *24 

J 

127*5 ±*38 (83) 

1 18*9 ± *39 (52) 

5*10±*27 

4*16± *28 

4*05 ± *24 

3*50 ± *23 

iVff, It 

50*0+ *18 (112) 

47 *2 ±*20 (86) 

2*82± *14 

2*79± *14 

5*64 ± *25 

5*91 ± *30 

NH,L 

50*1 ±*19 (112) 

47*3 ±*20 (86) 

3*05 ±*14 

2*70± *14 

6*09± *28 

5*71 ± *30 

NH' 

50*0 ± *19 (105) 

47*1 ±*19 (90) 

2*95 ± * 15 

2*73 ±*14 

6*90 ± *27 

5*80± *29 

NB 

25*8±*11 (114) 

26*0+ *15 (89) 

1*79 ±*08 

2*13± *11 

6*94 ± *31 

8*52 ± *43 

DS 

11*6 (32) 

11*1 (22) 

— 

— 

— 

— 

DC 

22*7 (33) 

22*2 (23) 

— 

— 

— 

— 

DA 

34*5 (32) 

33*0 (22) 

— 


— 

— 

SS 

4*0± *09 (96) 

3*1 ±*09 (69) 

1*38± *07 

1*12± *06 

34*60 ±1*88 

36*13±2*33 

SC 

10*8 ±*13 (95) 

10*2 ±*17 (72) 

1*99± *10 

2*15 ± *12 

18*43+ *93 

21*08±1*28 

01 

52*0+ *28 (87) 

49*7 ±*26 (59) 

3*82±'14 

3*01 + *19 

7*35 ± ‘38 

6*06 ± *38 

01 

47*3+ *21 (88) 

45*7 ±*24 (62) 

2*94± *15 

2*84± *17 

6*22 ± *32 

6*21 ± *38 

02 

39*4 ±*22 (82) 

37 *8 ±*23 (62) 

2*90+ *15 

2*71 ±*10 

7*36 ± *39 

7*17 ± *44 

EH 

13*2 ±*18 (55) 

12*3± *18 (47) 

. 2*03 ±*13 

1*85 ±*13 

16*38 ±1*01 

15*04±1*07 

Oly R 

41*6± *11 (116) 

39*7 ±*12 (80) 

1*77 ±*08 

1*00 ±*08 

4*25 ± *19 

4*03 ± *21 

0\y E 

41*5± *11 (105) 

39*7 ± *11 (84 

1*73 ±*08 

1*44 ±*07 

4*17 ± *19 

3*63 ± *19 

0\, R 

39*4 ±*19 (46) 

37*7 ±*14 (47) 

1*91 ±'13 

1*47±*10 

4*85 ± *35 

3*90± *27 

Lacrymal 0 ,, R 

38*2 (28) 

30*6 (40) 

— 

— 

— 

— 

Oi,R 

32*7 ±*13 (114) 

32*5 ±*13 (83) 

2*06± *08 

1*89 ±*09 

6*30 ± *28 

5*81 ± *30 

O t yL 

32*7 ±*14 (103) 

32*6±*14 (85) 

2*06 ±'10 

1*90 ±*09 

6*30+ *30 

5*84 ± *30 










202 A Study of 12 th cmd 18 th Dynasty SkuMsfrom Kervna 

TABLE II {continued). 



Means 

Standard Deviations 

Character 







Male 

Female 


Male 

Female 

100 BIL 

72*2 ± *20 (133) 

73*3 ±-19 (112) 

3*38 ± *14 

3*01 ± *14 

100 H'jL 

72-4 ±-l» (91) 

72*4 ± *17 

(88) 

2*67 ±‘13 

2*37 ±*12 

100 H/L 

73*2± *18 (100) 

72*9± *18 

(77) 

2*61 ± *12 

2*33 ±*13 

100 /?///' 

100*6 ±*31 (91) 

101 *9 ±*32 

(88) 

4*34 ±*22 

4*50+ *23 

10021/// 

99-0+ -30 (98) 

101 *6 ±*34 

(77) 

4*42 ±*21 

4*44 ±*24 

1 (X)(B—H f )/L 

0-5 ± -23 (89) 

1‘3 ±*24 

(88) 

3*05 ±*15 

3-29 ±-17 

100 O'UIUB 

72-7 ±'33 (104) 

72-2 ±*33 

(79) 

5-00 ±-23 

4*32 ±*23 

100 NBINH , It 

51*6±*25 (111) 

53*2 ±*35 

(84) 

3*94 ±*18 

4*72 ±*25 

100 NB/NH , L 

51 *6 ±*25 (111) 

53*3 ±*34 

(85) 

3-95 + -18 

4*78 ±*24 

100 NB)NH ' 

f)l-8f27 (105) 

53-4 ±-36 

(85) 

4*16± *19 

4*87 ±*25 

100 SS/SC 

37 *3 ±*78 (95) 

30 *5 ± *72 

(69) 

11 -22 +-55 

8*85 ±*51 

100 DSj DC 

51*3 (32) 

50*7 

(22) 

— 

— 

100 0 2 /&i 

75*7 ±*53 (67) 

75*5 ±-65 

(42) 

6*39 ±*37 

6*26 ±*46 

100 GJG{ 

83*6 ±*58 (67) 

82*5 ±*71 

(44) 

7*04± *41 

7-00+ -50 

100 EH/(j? 2 

33*3 ±*53 (55) 

32*8 ± *52 

(47) 

5*81 + *37 

5 *29 ±*37 

too o 2 /o u ll 

78 *6 ±*32 (113) 

81 -0 ± -33 

(83) 

5 *06 ±*23 

4*49 ±*24 

100 Ot/Ou A 

78*9 ±*32 (104) 

81 *8 ±*33 

(84) 

4*85 ±*23 

4*44 ±*23 

100 0^/0^ 11 

81 *6 +*52 (44) 

84*9 ±*47 

(46) 

! 5*1 4 ±*37 

4*74 ±*33 

100 O a /Lacr. Oj , H 

84*0 (28) 

87*7 ±*52 

(40) 

— 

4 *84 ±*36 

100 fmb/fml 

83*2 + *41 (106) 

82*0+ *39 

(81) 

6*23 + *29 

5*92 ±*31 

Oc. /. 

60’4± *1 5 (124) 

60*6± *16 

(97) 

2*50 ± *1 1 

2*37 ±*11 

Nu 

64°‘9± ‘21 (95) 

67° *6 ± *30 

(63) 

3° *08 + *15 

3°*52±*21 

A L 

74 w *l±*24 (95) 

72°*1±*27 

(63) 

3° *52 ±*17 

3°*22± *19 

Bl 

41°*0± *20 (95) 

40°*3± *18 

(63) 

2° *85 + *14 

2°*15 + *13 

Alveolar P L 

83°*8± *27 (69) 

83°*6± *45 

(64) 

3° *38 + *19 

4°*96±*32 

Prosthion P L 

81 °*8 ± *27 (88) 

81°*8± *46 

(69) 

3° *73 ±*19 

5°*21 ± *32 


31°*0± *22 (62) 

28° *1 + *52 

(45) 

2°*62 ± *16 

5°*16± *37 

0* L 

10 u *4 ± *27 (62) 

12°*3±*53 

(45) 

j 

3°*11 + *19 

5°*25± *37 


whether the male and female series represent the same racial typo. s The differences 
of the corresponding mean indices and angles for the two sexes are found to exceed 
three times their probable errors in the case of 100 BjL (A/p.e. A =» 4*0), 100 SS/SC 
(6*4), 100 0 2 /0 x , R (6 5), 100 0 2 /Oi, L (6- 3), 100 (tyO/, R (4*8), 100 BjH (5*7), 
100 NBINH , R (3-7), 100 NB/NH , I (4*0), 100 NBINH' {3 6), NA (7*4), A A (5*5), 
(51) and 0 2 Z.(3’2). Sexual differences of the same sign as those now found 
are to be expected in the case of the first five of these indices. The difference in 
the case of 100 BjH may be due to chance causes as that for the very similar 
index 100 B/H 9 is insignificant. The other measurements selected in this way are 
the nasal indices, which all measure the same character in slightly different ways, 
and four angles which are necessarily related. They simply show that the female 
type has a significantly higher nasal index and a greater degree of prognathism 
(judging by the nasal but not by the profile angles) than the male*. This might 
suggest that there was more negro blood among the women than among the men 

* These relatione are still true if the Twelfth and Thirteenth Dynasties are considered separately, as 
is shown by the table on p. 260. But the sexual differences between the corresponding means there are 
probably all insignificant. 



Margot Collett 


263 


of Kerma, but it would be rash to accept such a conclusion without further evidence, 
and comparisons of a different kind made below (p. 272) fail entirely to sub- 
stantiate it. The fact that the majority of the measurements of shape show either 
insignificant differences, or differences which would have been expected, must be 
taken to indicate that the male and female samples really represent the same 
racial type. 

Sexual differences in variability may be considered next and comparisons were 
made between the coefficients of variation of the absolute measurements and the 
standard deviations of the indices and angles. For these constants 48 male values 
exceed the corresponding female values and for the remaining 19 the female are 
in excess. For the long Egyptian E series the male variabilities are the greater in 
46 cases and the female in the other five, but a proportion more similar to that 
found for the present series has been observed for most other long cranial series 
available. The only significant difference found for the Kerma male and female 
coefficients of variation is for frnl (A/p.e. A = 4 9) and for the standard deviations 
of the indices and angles the only significant differences (A/p.e. A > 3) are in the 
case of the Alveolar Pz (4*2), Prosthion Pz (4*0), 6 jZ (6*0) and (4*7). 
Finally we may compare the Kerma variabilities with those given for other series 
and it will suffice to consider only the Egyptian E series in this connection. It is 
known to be more homogeneous than almost all others that have been measured. 
Still comparing the coefficients of variation of absolute measurements and the 
standard deviations of indices and angles, it is found that 31 of the male Kerma 
variabilities are greater than the Egyptian E and 15 less, while four of the 
differences exceed three times their probable errors; for the females the Kerma 
variabilities are the greater in 26 cases, the lesser in 19 and there is equality in 
one case, while 10 of the differences may be considered significant. The Kerma 
series is thus rather more variable than the Egyptian E , but it is as homogeneous 
as many which have to be accepted as representing single and indivisible racial 
types. 

(3) Remarks on the Condition and Anomalies of the Kerma Skulls. The crania 
from Kerma were examined for anomalies after the method customarily employed 
in the Biometric Laboratory in recent years. There are totals of 141 male and 
114 female specimens*, but owing to the incomplete nature of several of the skulls 
the totals which might have been affected are recorded separately for several of 
the more important anomalies. 

(a) Sutures . Remarks on the condition of the coronal, sagittal and lambdoid 
sutures for each skull are given in the appended tables of individual measure- 
ments. Unless otherwise stated it may be assumed that any one, or all, of these 
sutures are open for their entire lengths. The approximate estimate of the age 
constitution of the sample which can be deduced from these data has been con- 
sidered above. Another point of interest is the order of closure of the three 

# An adult female skuU (No. 254) is not inoluded in this total and it was not measured since it 
appears to be soaphocephalic. The sagittal suture is completely obliterated, the coronal is closing and 
the lambdoid clearly open for its whole length. 



264 A Study of 12 th and 13 th Dynasty Skulls from Kerma 

principal sutures since a clear racial difference is supposed to be shown in this 
respect. The following table gives the frequencies found for the different orders of 
closure : 


Sex 

Sagittal 
closing before 
ooronal and 
lambdoid 

Sagittal and 
ooronal dosing 
together before 
lambdoid 

Sagittal and 
lambdoid closing 
together before 
ooronal 

Coronal 
dosing before 
sagittal and 
lambdoid 

Lambdoid 
dosing before 
sagittal and 
lambdoid 

Totals 

Coronal 

dosing 

before 

lambdoid 

Lambdoii 

dosing 

before 

coronal 

6 

75 

20 

3 

18 

1 

117 

H 

20 

? 

34 

18 

2 

14 

2 

70 

Bfl 

10 


It is apparent from these figures that the sagittal shows a clear tendency to close 
before the other two sutures, while the coronal also tends to close before the 
lambdoid suture. The order of closure found with the greatest frequency in the case 
of both sexes is sagittal— coronal-— lambdoid. Three other series have previously 
been examined in the Biometric Laboratory in exactly the same way and it was 
found that for negroes (Teita) the coronal suture showed a slight tendency to close 
before the sagittal in the case of the males, and a definite tendency in the case of 
the females, though the numbers (37 males and 19 females) on which these 
estimates were based are small. Both the coronal and sagittal sutures showed a 
marked tendency to close before the lambdoid. In the Hythe and Spitalfields 
English series the sagittal suture was found almost invariably to be the first to 
close, or to begin closing, followed by the coronal and lambdoid sutures which 
began closing at approximately the same time. Thus the Kerma type occupies an 
intermediate position between the Teita— a negro series— on the one hand and 
the Hythe and Spitalfields European series on the other. For the negroes the 
coronal suture closes before the sagittal and well before the lambdoid; for the 
Nubians the coronal suture closes after the sagittal but before the lambdoid again, 
and for the Europeans the coronal suture closes after the sagitfal but almost at 
the same time as the lambdoid. These are only average results, and the numbers 
on which they are based are unfortunately small, but in confirmation of Gratiolet’s 
Law they appear to indicate a definite racial difference in the order of sutural 
closing. The figures for the series examined in the Biometric Laboratory suggest 
that the tendency for the coronal suture to close before the lambdoid is rather 
more marked in the case of females than in the case of males for any particular 
racial series. As the numbers for the Kerma series are fairly adequate, it was 
considered worth while to examine whether the order of closure of the coronal and 
lambdoid sutures is associated with the magnitude of breadth measurements of 
the calvaria. The following table giving measurements for the two groups of male 
skulls from Kerma shows that the differences between the corresponding means 
are clearly insignificant : 


Order of suture-dosing 

I?' 

B 

Biasterionic B 

L 

Coronal suture closing before lambdoid 
Lambdoid suture dosing before ooronal 

90-8 (72) 
93‘8 (18) 

133*6 (67) 
134*8 (19) 

105-4 (57) 
105-7 (13) 

185*7 (67) 
185*7 (18) 












Margot Collett 


265 


Complete cases of metopism were found in four male out of a possible 89 skulls, 
and six female out of a possible 97 skulls, only small traces of a suture above the 
nasion being found in other specimens. All the longer European series to be 
examined have shown a percentage of about 10 for metopic sutures and in the 
case of negro series the frequency is about 1 per cent., among the 122 Teita adult 
skulls only one metopic suture being found. The Kerma skulls, therefore, with a 
percentage of 4 - 5 for the males and 6 2 for the females, again occupy an inter- 
mediate position between the negro and white races. Contact between the frontal 
and parietal bones in the case of a metopic suture being present is indicated by 
LF+RP or RF+LP and the measurement given is the length of the common 
suture. The males affected are Nos. 107 (contact?), Ill (at bregma), 113 (at 
bregma), and 126 (? RF+ LP); and the females Nos. 163 (LF + RP, 31 mm.), 
165 (RF + LP, 3'2 mm.), 166 (LF + RP, 8 2 mm.), 202 (LF + RP, 5-7 mm.), 219 
(LF+RP, 2-9 mm.) and 224 (LF+RP, 7'7 mm.). In accordance with what has 
been previously found, contact between the left frontal and right parietal bones 
is the most frequent occurrence. It has been stated that when the metopic 
suture persists to an adult stage it closes at about the same age as the sagittal. 
Five out of the ten metopic Kerma skulls have both frontal and sagittal sutures 
open ; in two other cases both are closed and in the remaining three the sagittal 
suture is open but the frontal is closed. One female specimen (No. 202) has 
the sagittal suture open but the frontal entirely closed and partly obliterated. 
Only when the squamous process made actual contact with the frontal bone 
were the cases of fronto-temporal articulation noted, these being in the males 
Nos. 62, 69, 136, 138 and 147 (on both sides), 70 and 124 (on the left but not on 
the right side), 3 and 51 (on the left while the right side was too defective to 
permit examination); and in the females Nos. 152, 194, 195, 199, 255 and 271 
(on both sides), and 160, 173, 203 and 230 (on the right but not on the left side). 
For the males there were 89 cases on the right side where the sutures at the 
pterion were sufficiently open to permit examination for cases of fronto-temporal 
articulation and of these five were affected giviug a percentage of 5’6, and on the 
left nine were affected out of a possible 101 (8'9 °/ 0 ) ; for the females, on the right side 
10 were affected out of a total of 83 (12 , 0°/ o ), and on the left six out of a possible 
92 (6'5 0 /J. These percentages are unusually high, as for European series in general 
the frequency is of the order of 1'6 per cent., but for negroes an average of 12 per 
cent, has been found*. The Kerma series it will be noticed once again occupies 
an intermediate position. No complete case of a horizontal suture across the malar 
bone occurs in the series, two female skulls however (Nos. 197 and 218) show 
slight traces of it. No traces of sutures between the ex- and supra-occipitals were 
observed. 

(b) Supernumerary Bonee. In the male series four cases of true interparietal 
bones were found out of a possible 111 skulls (3'6 5 / 0 ) and for the females only one 
out of a possible 104 cases (0'9 °/ 0 ) was noted, thus giving a percentage of 2'3 for 

# Cf. le Double, Trait# d$s Variations des Os du Crdne de V Homme (1903), pp. 809 — 308. 

Biometrika xxv 


18 



266 A Study of 12 th and 18 th Dynasty SkuUsJrom Kerma 

the whole series. Owing to the irregularity of the percentages found for both 
European and negro series in this respect, it is not known whether any racial 
significance can be attached to the Kerma figures. Of the male specimens, No. 86 
shows the os pentagonals and right os triangulare separate but with no suture 
between them; the other three skulls (Nos. 98, 121 and 130) have the os penta - 
gonale only separate. The female specimen (No. 198) also has the os pentagonale 
only separate. An os ipactal* was noted in one male skull (No. 41): it is divided 
by a vertical suture which is to the right of the sagittal suture. There are no 
examples of an ossicle of bregma and only small Wormian bones were found in the 
coronal and sagittal sutures. The Wormian bones in the lambdoid suture appeared 
to be smaller and less frequent than in European series. Two unusual anomalies 
were noted; a male specimen (No. 107) has a large Wormian bone between the 
temporal squama and parietal on the left side and a female specimen (No. 153) 
has two large and two small Wormian bones in the same position on the left while 
the right side is normal (see Plate IV b ). Several cases of an ossicle of lambda 
were observed. Records were made of the epipteric bones found, only those with 
a maximum diameter greater than 3 mm. being counted. In compiling the total 
numbers which might have possessed these supernumerary bones all those skulls 
were excluded on which the superior border of the greater wing of the sphenoid 
was indefinite on the side considered owing to synostosis. More males than females 
were excluded on this account. The following frequencies were found for the group 
having the region of the pterion intact on both sides and with the sutures there 
visible : 


Sex 

No epipteric 
bones 

Epipteric bones 
(one or more) 
on both sides 

One or more 
epipteric bones 
on the right but 
none on the left 

One or more 
epipteric bones 
on the left but 
none on the right 

Totals 

s 

40 

4 

4 

5 

53 

9 

43 

9 

7 

9 

68 


A considerable difference is shown between the percentages for the two sexes, the 
males having 13 skulls with one or more epipteric bones out of a possible 53 cases 
(24*5 °/ 0 ), and among the females there are 25 cases out of a possible 68 specimens 
(36*8 %)• According to le Doublet it is not known for which sex the anomaly 
occurs with the greater frequency and comparison with the Hythe and Teita series 
does not encourage the assumption that there are any sexual or racial differences. 
The percentage for the Teita male series is 22*2 (or 8 out of a possible 36 cases) 
and for the female 19*5 (or 8 out of a possible 41 cases). For the Hythe skulls 
the male percentage is 32*0 and the female 25*8. 

(c) Teeth . The state of preservation in which the teeth of the Kerma skulls 
were found was remarkably good, and there was a very high percentage of palates 

* Cf. le Double, Traitf, det Variations des Os du Crdne de V Iiommt (1908), p. 60. 

+ Op. cit. p. 806. 





Margot Collett 


207 


with no teeth lost before death. There are 111 complete male palates of which 71 
h**e no teeth lost before death (64*0°/ o ) ) and for the females the percentage is 
69*fi, 57 skulls from a total of 82 complete palates not having lost any teeth before 
death. Excluding a few cases, where it could not be determined whether the third 
molars had erupted or not owing to the loss of one or more molars before death, 
an investigation with regard to the presence or absence of third molars was made 
with the following results : 



Region of third 
molars complete 

No third 

Third molar 

Third molar 

Percentage 

Sex 

molar on 

on right 

on left 

having one or 

and undeformed 

either 

but not on 

but not on 

both third 


on both sides 

side 

left side 

right side 

molars miBsing 

<$ 

96 

9 

3 

1 

13-6 

9 

73 

2 

0 

5 

9*6 


Considering the sides separately, in which case a few additional specimens with 
the region of the third molar complete on one side but defective on the other can 
be included, it is found for the males that on the right side there is an absence of 
the third molar in 12 cases out of a possible 102 (11 '8 %) an( i on the there are 
12 showing an absence out of a possible 99 (12T °/ G ). For the females there is 
absence in seven cases out of a possible 79 (8*9 °/ 0 ) on the right and four out of a 
possible 82 (4*9 %) on the left. Several dental anomalies were observed among 
the male skulls : No. 79 has only a single premolar on the right, No. 92 (Plate VI a) 
apparently lacked two teeth other than molars, No. 123 has no canine on the right, 
while No. 128 lacks a canine on the left side. The second right incisor in skull 
No. 101 has erupted behind its normal position. A curious condition is present in 
No. 107 (Plate VI d ) which apparently had only three incisors of which one was 
placed centrally so that it appears to have erupted between the premaxillary bones. 
The socket for the third incisor present on the left in skull No. 4 is the only case 
of an extra tooth among the male specimens. The female specimen No. 155 
(Plate Vic) has no canine present on the left side and no third molar on the right. 
An absence of lateral incisors was noted in skull No. 216 (Plate VI 6), and No. 266 
has only one incisor on the left and no canine or third molar on the right. Several 
examples were observed of crowding of the teeth and of deformations of the alveolar 
margin due to disease (see Plate VI a showing an opening in the region of the first 
and second left molars). 

( d ) Other Anomalies . The relative sizes of the jugular foramina were compared 
with the following results : 


Sex 

JR 


JL 

6 

74(66 7,) 


20 (17 7J 

9 

48 (62 7 0 ) 


20(26 7 .) 















208 A Study of 12 th and 18 th Dynasty SkulUfrom Kerma 

These results are quite in accordance with those for all series previously examined 
in this way. Precondyles, both single median and double, were noted in several 
cases, a female skull (No. 194) with a single median precondyle having the largest. 
There are no marked cases of a fossa pharyngea. A female specimen (No. 180) 
has a large para-mastoid process on the left side with an articular surface, and one 
case of a basi-occipital incisure was observed on the left side of another female 
skull (No. 193). Plate V b illustrates a curious condition present in a male skull 
(No. 36) ; this is a constriction of the inferior part of the basi-occipital where it 
unites with the sphenoid. The numbers of tympanic perforations were recorded 
but no details are given as no racial or sexual differences, or evidence that they 
were more numerous on one side than on the other, could be deduced. What appears 
to be an infantile condition of some interest is present in a female skull (No. 216 : 
Plate V d) which shows a complete failure to unite on the superior and inferior 
margins of the tympanic plate on the left side and a union only just made on the 
right*. None but very small exostoses were found. Three healed wounds were 
observed among the female skulls from a total of 114, but, as is to be expected, 
the wounds on the male skulls were more numerous, totalling 16 for the 141 skulls. 
Those of greatest interest are on male specimens and photographs of two of these 
are given : No. 74 (Plate IY a) has a depressed wound on the right parietal and 
temporal squama and a healed fracture of the left malar bone ; No. 64 (Plate V c) 
has what appears to be a wound on the right maxillary bone below the orbit. 
No. 7 shows a fracture and subsequent rejoining of the left zygomatic arch. One 
case of a wound on the nasal bones is noted in skull No. 66. Traces of roughening 
of the glenoid fossa due to arthritis were found on several skulls of which the males 
Nos. 46, 119 and 146, and the females Nos. 151 and 236 were the most marked- 
The only other sign of disease was observed on the female skull No. 274 (Plate V a); 
the diseased parts are two symmetrically placed areas on either side of the lambda, 
the lambdoid suture being prematurely closed where it lies within the diseased 
areas. This may be a case of periostitis. Holes similar to those found by Prof. 
Elliot Smith on Nubian skullsf and supposed due to insects were evident on 
several of the Kerma skulls (see Plate III b below). The holes are mostly small 
and not nearly so numerous as on the skull he figures. No signs of healing are 
shown on our specimens and the excavations were almost certainly made after death. 


(4) Comparisons between the Kerma and other Racial Series. In order to gain 
a knowledge of the racial relationships of the Kerma series, the Coefficients of 
Racial Likeness were calculated between its means and those available for a number 
of other series. With the usual notation, the form of the crude coefficient used is 


1 v n,n , 

** — . X 

m oy rit -1- n y 


±•07449*/?- - 2(o)- 
V m . m 


1 ± -67449 


A 


* The adolescent Sinanthropus skull shows a similar fissure of the tympanie plate, and the condition 
is normal for infants to-day. 

t The Archaeological Survey of Nubia, Report for 1907—1908, Cairo (1910), Vol. n, p. 290, and 
Platte Accompanying Vol. II, Plate xxx, Fig. 2. See also “The alleged Discovery of Syphilis in Pre- 
historic Egyptians,” The Lancet, August 22 (1908), p. 521. 



Margot CJollktt 


269 


fte reduced coefficient is defined to be 

... ^ 50x w[s s ( , ‘>- 1± - 67 ‘ w \/l]' 

where n, and n,' are the mean numbers of skulls available for the characters used 
in computing the coefficient for the first and second series in the comparison, 
respectively. The reduced coefficient is supposed to give the best measure of the 
absolute divergencies between the types which it is possible to find at present. 
The standard deviations of the long Egyptian E series of Twenty-sixth to Thirtieth 
Dynasty Bkulls were used in computing the coefficients. 

The male and female Eerma series are of approximately equal length, but this 
is not the case for the majority of cranial series for which measurements have been 
provided. In general the male series are longer than the corresponding female 
ones and there are far more of them available. Our estimates of racial affinity 
are thus based principally on the comparisons of the male series. There are 20 of 
these series of sufficient length representing different sections of the population of 
Egypt from Early Predynastic to Roman times. Of the sites represented Qizeh and 
Deshasheh with Medum are in Lower Egypt, but all the others are in Middle 
or Upper Egypt. Table III gives all the coefficients of racial likeness between 
these Egyptian and the Eerma series and the reduced male values range from 
I’ll to 19'56. Working on data provided by Morant, Woo has given* the 
lowest reduced coefficients of racial likeness found for each of 23 male Egyptian 
series, and they range approximately from — 0‘5 to + 4'5. Three of the values now 
found with the Eerma series are less than 4r5 and two are as low as any which it 
has generally been possible to find for these types. We are thus led to conclude 
that the population of Kerma in the Twelfth and Thirteenth Dynasties was of 
typical Egyptian type. It has been shown by Morant f that in Upper Egypt the 
character of the population was changing slowly from Early Predynastic to Roman 
times, the type having gradually lost its original negroid characters and at the 
same time increased its calvarial breadth and hence its cephalic index. This 
general tendency was illustrated by the majority of the series available, and the 
exceptions to it which were found could be supposed due to the peculiarities of 
local types or possibly to a local and restricted admixture of part of the population 
with alien immigrants. The question to what period the Upper Egyptian series 
which most closely resemble the Eerma belong is one of particular interest. Its 
closest connections are found with the Naqada series which is probably of Middle 
Predynastic data and with a Late Predynastic series measured by Thomson and 
Maclver J. A First Dynasty series is next in order and then the Eerma type m 

* Biometrika, Vol. xxn (1880), p. 76. . t Ibid. Vol. xvn (1926), pp. 1—62. 

J Fox the series measured by Thomson and Madver only 14 of the complement of 81 characters 
used in computing the coefficient of raoial likeness are given. For these 14 characters the male orude co- 
efficient between the Kerma and Naqada A and Q series is 1*21 ± '26 and the reduced value is 1'49* '81. 
These only differ slightly from the values in Table III given for the total 81 characters, and it may be 
assumed that die 14 characters provided by Thomson and Maolvei lead to dose approximations to the 
coefficients which would be obtained if all 81 characters could be used in comparisons with their series. 
It may be concluded that the Kerma series is equally related to the Naqada and to the Late Predynastic 
series of Thomson and Madver. 



TABLE in. 

Coefficients of Racial Likeness between the Kerma (12 th and 13?/t Dynasties) and other St 


j 

i 

I 

1 

1 

0+ 

88 8? S s s $ 

Ijjl l||l M 1 JM 1 Igll Ijll 1 I 1 l|l II 

0>C0 « J ^ 

♦o 

52s$gs2$83s?s?s&s8??83ss;8?^?a3 
+1 +1 +1 +1 +1 +1 +1 +1 +1 +1 ■« +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 -H +• + 1 +f + 1 
SS3$$R8fc888fcS:i:9S88a????8??f??®S 

Grade Coefficients 

0+ 

II II I I I I 

kftjfJiftiQ t> 00 f'* O 

l;,;,l I i l;,i 1 1 i;,l H I M;,i l I 

gs os o w i - 32 

Nip «9 rH rH 9 W 

w^tcbi-* n ^ © co 


Os © (N c*T W 

W i-I r-^fN 

I' 10 00 1 C 1 C a O O 00 1 ' Oi 00 O X O I> t' h N h 1^ 00 JO H o 05 o o 

+1 +1 +1 +1 +| -1*1 +1 +1 +1 +1 +1 *fl + ! +1 +1 4-1 +1 +1 +1 +1 +1 +1 +1 +! +1 +1 +1 +1 +1 +1 +1 
f-<-<ti^p-no(NoocD < ^t'*05050«MN'2't^2i52'^ (N !i222S22£ri23$SS32 

ffiiCffiia«^25Hf|}(Ni«05l>(Nt'*9ip«Nt'HflDfl>W©nNOONW 

6<^^rt«'W'»w& , ^»rtWosw»bco©05®i>ai&^cb»H6cN<»cpi'- 

0} fH H H |H CM rH 

Kerma n 

Of 

'll 1 igli i iigi 1 1 ill i >S 1 1 1 1 1 i£i M 

* 

ny©(N«09^ia!&©i0WO9OCDH®9«cp«?5«Httio©Hi»M 
!*'• t*» r- »b t*» :> i- an i*' h (Jj t« h >b i< i> ft* r» oo t- t- (h r- t> W © ® «? 

OOOOOOOOOOOOOOOOOOOOOOOOOOOmOOO 

HHHHHHHnHHHrtHHHrlHHHHHHMHrtHHHHHH 

\ tt 

Of 

as os co 't w oo w 

1 23 M Sg M 1 1 8 1 1 1 1 2 1 1 1 S 1 1 1 1 1 1 2 1 M 

*o 

00(©C0OC505?Di0C09C000<3i(N^CSrHC0W«00rH9l-05905^CNO00 

§§S$|g$g®SSS$§S23l3SSSS§ggSSS?:S? 

»H rH rH 00 

j 

k 

* 

s* 

B 

s 

3 

Fawcett 

Thomson and Maclver 

S. Sergi 

Thomson and Maclver 

M 

Toldt 

Thomson and Maclver 

Toldt 

Stoessiger 

E. Schmidt 

Pooled 

Stahr 

Thomson and Maclver 
E. Schmidt 

Thomson and Maclver 
Pearson and Davin 
Motley 

Kitson 

Woo 

Harrower 

Morant 

E. Schmidt 

Thomson and Maclver 
Turner 

Reid 

/.Pooled 

H 

Period 

Predynastic 

Late Predynastic 

1st Dyn. (Private Tombs) 
Modem 

Early Predynastic 
12th— 15th Dyns. 

6th — 12th l)yns. 
Middle Dyns. 
Ptolemaic 

18th Dyn. 

Early and Middle Dyns. 
Early Predynastic 
18th— 21st Dyns. 
Modem 

18th — 20th Dyns. 

18th Dyn. 

Modem 

Roman 

26th— 30th Dyns. 

1st Dyn. 

Modem 

9th Dyn. 

Modem 

18th and 19th Dyns. 

4th and 5th Dyns. 
Modern 

» 

n 

rt 


! 

1 

Naqada (A and Q series) t ... 
El-Amrah and Hout 

Abydos, El-Amrah and Hout 
Abyssinia (Tigre District) % 
Abydos, El-Amrah and Hout 
Hou and Abydos f ... 

Denderah t ... 

El-Kubanieh North j 

Denderah t 

ShekhAlit 

El-Kubanieh South t 

Badari § 

Thebest 

Qaila and Somali j| 

Thebes X 

Abydost 

Negroes from Egypt |J 

Denderah t ... 

Oizeht 

Abydost 

Teita (Kenya Colony)j| 

Sedment X 

Tamils? 

Nepalese? 

Abydost 

Deshasheh and Medumt ... 

Dravidians? 

Tanganyika;! 

Hottentots fj 

Hindus? 

Yeddahs? 




Margot Collett 


271 


found to be equally removed from those of Early Predynastic and Early Dynastic 
populatuiM. It iaalso approximately equally, but more distantly, removed from the 
ewriietfc Ppedynaatic type (Badari) and from several found in Upper Egypt about 
the Eighteenth Dynasty. The relationships of the Kerma to most of the Later 
DynastiO Series, including the Roman, are more distant still. Exceptions to any 
orderly arrangement suggested by these facts must, of course, be expected. They 
are found, for example, in the rather close connection between the Kerma series 
and the Ptolemaic and in the dissimilarity of the former to the First Dynasty 
series from Abydos measured by Motley, to the Ninth Dynasty series from Sedment 
and to the Fourth and Fifth Dynasty series from Deshasheh and Medum, though 
it must be remembered that the last series comes from Lower Egypt. In these 
cases the other series, and not the Kerma, must be supposed peculiar, as has been 
pointed out in earlier papers. The evidence provides abundant justification for 
considering that our series represents the typical population found in Upper Egypt 
in Late Predynastic times, and it can only just be distinguished from other series 
representing that population. But actually our series is of Twelfth and Thirteenth 
Dynasty date, though it is clearly differentiated from the contemporary population 
of Upper Egypt. Obvious conclusions are suggested by these facts. It may be 
supposed that the colony at Kerma was founded in Late Predynastic times by a 
body of emigrants from Upper Egypt who were racially typical of the population 
there and that this type persisted at Kerma unchanged by admixture with any non- 
Egyptian elements, or by the factors which were modifying the parent population, 
until the Twelfth and Thirteenth Dynasties. It appears to be extremely probable 
that these hypotheses are the correct ones, but they could only be submitted to 
direct proof if we had skeletal evidence of the racial constitution of the population 
at Kerrna between Early Predynastic and Twelfth Dynasty times. 

It is certainly unlikely that any coefficients of racial likeness of the same order 
As the lowest in Table III could be found between the Kerma and any non-Egyptian 
series. There is a close connection shown with Abyssinians from the Tigre district, 
but all the closest connections as yet found for this series have been with Dynastic 
Egyptian types and it is supposed to represent a survival until modern times of 
part of that population. It has been shown by Stoessiger* that there is a moderately 
close resemblance between the Predynastic Egyptian series and modern Indian ones, 
and by Kitsonf that the former also resemble various negroid and Bantu negro 
East African types. It is among these two groups of races that we should expect 
to find those outside Egypt which will bear the closest resemblance to the Kerma 
type. Male coefficients of racial likeness were therefore calculated between the 
Kerma and all the best series available for the two groups and these are given in 
Table III. Of all the alien types compared, the Galla and Somali shows the lowest 
reduoed coefficient, though it is higher than eleven of the values found with the 
Egyptian (including the Abyssinian) series. A moderately close connection with 
that “Hamitic’’ series was to be expected. Neither is it surprising to find that the 


* Biometrika , Vol, xxx (1927), pp. 125—186. 


t Ibid. Vol. xxm (1981), pp. 285—800. 



272 A Study of 12 th and 13 th Dynasty Skulls from Kerma 

next closest connection is with a modern series from Egypt of which the origin is 
unknown and which is only 1 judged to be one of negroes on account of the cranial 
measurements. The Teita from Kenya Colony come next in order and they are 
undoubtedly Bantu negroes, while the Tamils are a little further removed. The 
resemblances in these cases, and those between the Kerma and the other East 
African and Indian types, are very much more distant than the majority of those 
between the Kerma and the Egyptian types. The fact that a negro series shows 
a lower reduced coefficient than three Egyptian series may be attributed to the 
fact that the latter are not of pure Egyptian origin. It has been shown by Woo, 
for example, that his Sedrnent series and that of the Fourth and Fifth Dynasties 
from Deshasheh and Medum are more closely related to modern Egyptians and 
modern. Cretans than are the other Dynastic Egyptian series. There is again no 
suggestion that the Kerma population was of any but pure Egyptian origin. 

From the 31 series for which male coefficients of racial likeness were calculated, 
eight were selected for the purpose of comparing the female means with the Kerma 
values. There are no adequate female means available for most of the other series. 
If the male and female means for each of two series compared really represent the 
same population, then we should expect to find that the reduced coefficients for 
the two sexes do not differ significantly. This is actually the case in five out of the 
eight comparisons made in Table III, the differences for these being all less than 
2*5 times their probable errors. For the other three — viz. the Late Predynastic 
series from EI-Amrah and Hou, the Twelfth to Fifteenth Dynasty from Hou and 
Abydos, and the modern series of negroes from Egypt — the sexual differences 
between the coefficients are markedly significant. The means for the last series 
are based on such small numbers that no stress need be placed on the discordance. 
In the case of each of the other two a direct comparison of the male and female 
means suggests that the samples may not represent exactly the same population. 
It has been shown in section (2) above that the Kerma male and female indices and 
angles only differ clearly in an unexpected way in the case of the nasal indices and 
of the angles of the fundamental triangle, the female type having the higher index 
and the greater nasal angle, indicating a greater degree of prognathism. This might 
suggest that the female Kerma sample was more negroid than the male, but the 
coefficients with the Teita and Tanganyika negro series fail entirely to substantiate 
this view. The Kerma sexual differences for these characters, though of an order 
which must be considered significant, may only have been due to chance causes. 

A detailed comparison of measurements considered singly need not be made. 
If only the 31 characters used in computing the coefficients of racial likeness are 
considered, the a’s found show whether the differences between the means compared 
should be considered significant or not. We will suppose, as usual, that a significant 
difference is indicated if the a is greater than ten. The proportions of significant 
to non-significant as are found to differ markedly for different characters as has 
been observed in all previous comparisons of this kind. The coefficients (given in 
Table III) were computed between the Kerma male series, on the one hand, and 



Margot Collett 


27ft 


21 series of the ancient Egyptian type (including the modem Abyssinians), on the 
other. There are eleven comparisons between median sagittal arcs ( 8) from nasion 
to opisthion — the (measurement not being available for the other ten series — and 
not a single one of these is significant. For the basio-bregmatic height (H r ) only 
one significant difference out of 21 is found, and for the glabella-occipital length 
there are only three a ' s greater than ten in 21 comparisons. Such characters will 
dearly be of no value if a classification of the material on the basis of single 
measurements, or groups of a few measurements, is attempted. The only measure- 
ments which are likely to be of any value for such purposes are those for which 
more than 80 per cent., say, of the possible comparisons show significant differences. 
In the present case these are : 100 B/L (61*9 °/ D ), 100 NB/NH (52*4 °/ 0 ), B (42*9 %), 
100 (fyGi (42-9%), NH (38*1%), £'(36*4%) and Q'(33*3%). The percentage 
for 100 Gt/Gh is only based on a total of seven comparisons and no stress can be 
laid on it, while the other characters selected in this way really only indicate that 
two primary factors are concerned. The differences between the maximum calvarial 
breadths ( B ) are necessarily associated with those between the transverse arcs (Q') 
and cephalic indices — the calvarial length and height remaining practically con- 
stant — and with those between the minimum frontal diameter (2?'); while the 
differences between the nasal heights are necessarily associated with differences 
between the nasal index— the nasal breadth remaining practically constant. In 
comparing the Egyptian series Morant found that the essential differences were, 
in general, those between the calvarial breadths and the characters dependent on 
that character, and also those between the nasal indices. The Kerma series thus 
differs most markedly from the Egyptian series in precisely the same characters 
as they, on the whole, differ most markedly from one another, or, in other words, 
the Kerma type must be considered to be a perfectly typical representative of the 
Egyptian stock, as we have previously concluded from a comparison of the co- 
efficients of racial likeness. When comparison is made between the Kerma and^ 
the non-Egyptian series the incidence of characters which differ most essentially 
alters as we pass from series to series and a detailed description of these differences 
would not be profitable. 

(5) Type Contours. The individual contours were drawn and the type contours 
were constructed from them by following the usual methods employed in the Bio- 
metric Laboratory. The mean measurements are given in Tables IV — VI and the 
male and female types are shown in Figs. II — VII. Comparison of the type contour 
with the mean calliper measurements in Table VII shows a very satisfactory agree- 
ment The differences are all less than 1 mm. except in the case of the male and 
female 3 % — the chord from nasion to bregma— and the female GL. The difference 
of 1*4 mm. in the last case is chiefly due to the fact that the two means are based 
on very different numbers and when found for the same 33 skulls it is reduced to 
0*6. It is probable that the rather large differences found in the case of Si were 
caused by the fact that the nasion was not located in precisely the same position 
when drawing the contours as when taking the calliper length. Table VII shows 
clearly that the practice of raising or lowering the pointer of the tracer in order 



274 A Study of 1 2th and 1 2th Dynasty Shulls from Kerim 


TABLE IV. 


Mean Measurements of Kerma Transverse Contours . 



TABLE V. 


Mean Measurements of Kerma Horizontal Contours . 





















































































TABLE VI. 

Mean Measurements of Kerma Median Sagittal Contours. 


875 



The occipital points are above the Ny lines. 


































































270 A Study of 12th and \Sth Dynasty Skulls from, Kemna 


TABLE VII. 

A Comparison of Calliper and Type Contour Measurements *. 


Character 

Male 

Female 

Contour 

Calliper 

Contour 

Calliper 

L 

186*2 (98) 

186-2 (138) 

176-7 (73) 

177*2 (114) 

OH 

110*7 (124) 

110*7 (125) 

107-7 (90) 

107-4 (91) 

Si 

113*5 (98) 

112*1 (126)+ 

108-7 (73) 

107*4 (109)} 

Si 

134*9 (98 

115*1 (128) 

110-4 (73) 

110*4 (106) 

Si 

97*4 (92) 

97*2 (124) 

94*7 (69) 

94*3 (97) 

ftnl 

H’ 

38*6 (84) 

36-4 (112) 

34-7 (64) 

34-6 (92) 

134*3 (84) 

133-4 (93) 

128-1 (64) 

127*8 (88) 

OH 

89-6 (50) 

69-6(111) 

66-4 (37) 

65-7 (82) 

GL 

96-2 (46) 

96-5 (97) 

96-1 (33) 

93-7 (88)§ 

LB 

102-0 (84) 

101-8 (117) 

96-6 (64) 

96-6 (89) 


* The contour OH is the height {MA) of the transverse type contour, and all other contour measure- 
ments are taken from the sagittal type. The maximum length (L) is measured on that figure; G'H, GL 
and LB are used in its construction and all other measurements oompared are found by calculation 
from lengths used in the construction of the type. 

t For the 98 male skulls for which sagittal contours were drawn the mean calliper flf/ is 112*1. 
t For the 78 female skulls for whioh sagittal contours were drawn the mean calliper 8\ is 107*8. 

§ For the 88 female skulls for which the contour GL is given the mean calliper GL is 94*6. 


to pass exactly through the “points” in drawing the sagittal figures does not 
introduce any serious discrepancies. For all practical purposes there is little danger 
in assuming that all the points shown on that section really lie in a single plane 
in the case of each individual skull*. If the series be a long one it might be more 
satisfactory to omit the specimens for which this is least true. K 

The male transverse type is almost exactly symmetrical, all differences between 
the right and left sides of the same parallel being less than 1 mm. Greater 
differences are found in the case of the female figure and the largest is 2*5 for the 
ninth parallel. The Kerma sections show no striking peculiarities. A number of 
indices, providing measurements of shape, have been derived from these figures for 
the purpose of making racial comparisons more exact, and for all these the Kerma 
values fall well within the ranges given by racial series previously studied. The 
male and female indices are closely similar as is generally found. By superposition, 
with aid of the tracings provided, the Kerma outlines are found to be extraordin- 
arily close of those provided by the Teita negro series and rather further removed 
from the Badari types. 

The horizontal type contours are very approximately symmetrical, the maximum 
difference between the right and left sides of the same ‘parallel being 1*5 in the 
case of the male figure and 1*7 for the female. They are again found to have no 

* [I do not agree with this view. The “ practice ” does not seem to me satisfactory, u the asffttal 
section of the individual skull ceases to be a section of that skull at all, and the diagraph thus used 
can lose the advantages it has over the dioptograph. Ed.] 









Margot Collett 


277 


unusual characters and the indices which have been used for comparative purposes 
are all, with one exception, within the ranges previously found. The most striking 
sexual difference in the case of the horizontal contours is seen in the shapes of the 
sections of the temporal fossae. Immediately behind the temporal lines (marked 
by the points TR and TL) the male outline is contracted to a greater extent than 
the female. This difference has been measured by expressing the total length of 

L A R 



parallel 3 as a percentage of the width between the temporal lines, i.e. very 
approximately TR(y) + TL(y), The male Kerma index is 97*7 and* the female 
108*1, and this difference is rather greater than any previously found between the 
male and female values for the same racial series. The male index is also lower, 
indicating more marked temporal fossae, than any previously found. The Kerma 
type contours are again found to be remarkably similar to the Teita, the only 
difference which appears to have any significance being dependent on the fact that 





278 A Study of 12 th and 1 Zth Dynasty Skulls from Kerma 

the sections of the temporal fossae are more accentuated on the Kerma male figure; 
The Kerma and Badari types differ more markedly. 

The sagittal type sections possess no features which would have been unexpected 
in contours representing an Egyptian series, except, perhaps, a rather greater degree 
of prognathism. All the indices and angles which have been used to compare the 
most salient racial differences of the median sagittal section have values for the 

L A F 



Fig.HI Trctwvwe Type Contour, t?ctse<ion 90 £ Kerma SkutU 

Kerma series which fall well within the ranges given by the types previously 
recorded. Mean sexual differences are generally found for some of these and those 
of the same sign and the same order are observed in the present series. For 
example, the index expressing the maximum subtense of the frontal arc as a 
percentage of the Nft chord is 23*2 for the males and 24*1 for the females, and the 
fact that the female frontal bone is, on the average, more vertical than the male 
can be illustrated by angular measurements. A comparison of the mean indices 



Margot Collett 


279 







280 A Study of 12 th and 18 th Dynasty SkuUsfrm Kerim 













Margot Collett 


288 


and angles derived from calliper measurements for the two series showed that the 
most marked differences in shape which would not have been anticipated were 
found for the angles of the fundamental triangle. Confirmation of the correctness 
of the means can be obtained from the sagittal type contours. The values are : 


Sex 

Method of Measurement 

Nl 

al 

Bl 

i 

By callipers on skull 
Scaled on typo contour 

64°-9‘(96) 
64°-9 (46) 

74°'l (95) 
73°-4 (46) 

41°*0 (96) 
41°*7 (46) 

9 

By callipers on skull 
Scaled on tyjie contour 

67° - 6 (63) 

68° ‘3 (33) 

72°*1 (63) 
71°‘2 (33) 

40° -3 (63) 

40° ’5 (33) 


By superposing the types it is found that the Kertna sagittal section bears 
a much closer resemblance to the Badari than to the Teita. Strangely enough, 
considering that one of these series represents a tribe of Bantu negroes, the degrees 
of prognathism of these three are almost identical, but the section of the nasal 
bones projects considerably less in the Teita than in the Egyptian types. 

(6) Conclusions. The series of skulls described in this paper represents the 
population of the Egyptian settlement at Kerma (Nubia) in the Twelfth and 
Thirteenth Dynasties. Only adults were dealt with and of these 141 appeared to 
be males and 114 females. No distinction can be made between the series repre- 
senting the Twelfth and Thirteenth Dynasties respectively. The male and female 
series may be supposed to represent the same racial type. The male variabilities 
are rather greater than the female, as is generally found, and the population as 
a whole must be considered rather more variable than those in some Dynastic 
Eygyptian cemeteries, but still as homogeneous as most for which cranial samples 
are available. The type is distinctly negroid and the frequencies with which some 
anomalies are found assign it to an intermediate position between those of Bantu 
negroes on the one hand and European types on the other. The coefficients of 
racial likeness show that all the closest relationships of the Kerma series are with 
Upper Egyptian series of Predynastic and Early Dynastic date. The closest con- 
nections found are with Late Predynastic types and the Kerma sample may be 
supposed to represent the typical population of Upper Egypt at that time. It is 
distinctly further removed from the types contemporary with it found in Upper 
Egypt. Hence it is concluded that the settlement was probably founded in Late 
Predynastic times and that the racial type there persisted unchanged until the 
Thirteenth Dynasty although the parent population had been modified in the 
interval. Comparisons are also made with negro and Indian series, but no unex- 
pectedly close connections are shown. The type contours are provided. 

In conclusion I must thank Mr G. C. Dunning, some of whose measurements 
I have used, Miss M. Kirby, who drew the map and type contours, and Dr G. M. 
Morant, to whom I am greatly indebted for much help and encouragement. 

19-2 




A Study of 12 th and 13 th Dynasty Skulls from Kerma 


284 


DESCRIPTION OF PLATES OF KERMA SKULLS. 

I. Typical male skull (No. 27), Norma facialis (0*9 natural size). 

II. Typioal male skull (No. 27), Norma . lateralis (0*6 natural size). 

III. (a) Typical male skull (No. 27), Norma verticalis (0*7 natural size). 

This specimen has a cephalic index of 72*0, and the male mean for the serieB is 72*2. 

(6) Male skull (No. 180; 0'7 natural size). This shows holes presumed to hare been made by 
insects after death. 

IV. (a) Male skull (No. 74 ; 0*7 natural size), with a depressed wound on the parietal and temporal 

bones and a healed fracture of the malar bone. 

( b ) Female skull (No. 158; 1*2 natural size), with large Wormian bones between the temporal 
squama and parietal bone. 

V. (a) Female skull (No. 274 ; 0*7 natural size), showing diseased areas on either side of the 

lambda* 

(5) Male skull (No. 86; 1*4 natural size), showing constriction of the basi-oooipital where it 
unites with the sphenoid. 

(c) Male skml (No. 64 ; 0*8 natural size), showing a wound below the right orbit. 

(d) Female skull (No. 216; 1*3 natural size), showing failure of the tympanic elements to 
unite. 

VI. ( a ) Male skull (No. 92; 1*2 natural size), with the sockets of two teeth other than molars 

lacking and a diseased opening on the left*hand side. 

(5) Female skull (No. 216; 1*2 natural size), with no sockets for lateral incisors. 

(c) Female skull (No. 155; 1*8 natural size), with no sockets for the left canine and right 
third molar. 

(d) Male skull (No. 107; 1*5 natural size), with sockets for three incisors only, the socket for 
the oentral incisor being apparently between the pre-maxillary bones. 





Plate I 


Biometrika, Vol. XXV, Parts III and IV 

Collett : Skulls from Kama (Nubia) 



Typical Male Kerma Skull (No. 27). Norma facialis. 





A Typical Male Kerma Skull (No. 27). Norma lateralis. 






Plate III 


Biometrika, Vol. XXV, Parts III and IV 

Collett: Skulls from Kerma (Nubia) 



( b ) Male skull showing holes presumed to have been made by insects after death. 

Male Kerma Skulls. Normae verticales. 








Plate IV 


Biometrika, Vol. XXV, Parts III and IV 

Collet! : Skulls from Kcrma {Nubia) 



*(b) Female skull (No. 153), with largo Wormian bones between the temporal squama and parietal bone. 


Anomalous Kerma Skulls. 






Plate V 


Biometrika, Vol. XXV, Parts III and IV 

Collett: Skulls from Kerma {Nubia) 



(c) Male skull (No. 64), showing a wound 
below the orbit. 


{(1) Female skull (No. 216), showing failure 
of the tympanic elements to unite. 


Anomalous Kerma Skulls. 





Plate VI 


Biometrika, Vol. XXV, Parts III and IV 

Collett: Skulls pom Kerrna {Nubia) 



Of) Mule skull (No. with the sockets (b) Female skull (No. 210), with no sockets 

of two teeth other than molars lacking. I’ 01 ’ lateral incisors. 



(<■) Female skull (No. 10o), with no sockets (d) Male skull (No. 107), with sockets 

for this left canine and right third molar. for three incisors only. 


Anomalous Palates of Kerma Skulls. 




ON THE LIKELIHOOD THAT ONE UNKNOWN 
PROBABILITY EXCEEDS ANOTHER IN VIEW 
OF THE EVIDENCE OF TWO SAMPLES, 

By WILLIAM R. THOMPSON. From the Department of Pathology, 

Yale University, 

Section 1. 

In elaborating the relations of the present communication interest was not 
centred upon the interpretation of particular data, but grew out of a general 
interest in problems of research planning. From this point of view there can be no 
objection to the use of data, however meagre, as a guide to action required before 
more can be collected; although serious objection can otherwise be raised to argument 
based upon a small number of observations. Indeed, the fact that such objection 
can never be eliminated entirely — no matter how great the number of observations — 
suggested the possible value of seeking other modes of operation than that of taking 
a large number of observations before analysis or any attempt to direct our course. 
This problem is more general than that treated in Section 2, and is directly con- 
cerned with any case where probability criteria may be established by means of 
which we judge whether one mode of operation is better than another in some 
given sense or not. 

Thus, if, in this sense, P is the probability estimate that one treatment of a 
certain class of individuals is better than a second, as judged by data at present 
available, then we might take some monotone increasing function of P, say f {P) , 
to fix the fraction of such individuals to be treated in the first manner , until more 
evidence may be utilised, where 0 ^ f (P) < 1 ; the remaining fraction of such 
individuals (1 — f iP) ) to be treated in the second manner ; or we may establish a 
probability of treatment by the two methods of /<« and 1 — f {P) , respectively. If 
such a discipline were adopted, even though it were not the best possible, it seems 
apparent that a considerable saving of individuals otherwise sacrificed to the inferior 
treatment might be effected. This would be important in cases where either the 
rate of accumulation of data is slow or the individuals treated are valuable, or both. 

If we arbitrarily decide to eliminate the second treatment in favour of the first 
at this time, then the expectation of sacrifice to the inferior treatment would be 
(1 — P) for all subsequently treated individuals; whereas, if, for example, we take 
/ ( P) *= P, the expectation of such sacrifice would be temporarily 

P(1-P) + (1-P)P«2PQ, 



28(5 Likelihood that One Unknown Probability exceeds Another 


where Q = 1 - P. Obviously, 2 PQ ^ $ and, if P ± £ , then 2 PQ < ^ ; whence a saving 
is made in contrast to the so-called alternate case method . In the long run y if a real 
preference exists between the two treatments , the expected saving by continued 
application of this method of apportionment rather than by making immediate final 
decision is sensibly 1 — P of individuals subsequently treated. 

Obviously, if we are to operate in this manner, we need methods of evaluation 
of P for small as well as large numbers of observations. In the latter case many 
approximate methods are available in all fields although bounds to approximation 
have not been considered generally. 

In Section 2 a sampling problem is treated, which is equivalent to a special case, 
where we are to judge between two rival treatments upon the basis of the proba- 
bility of occurrence of a given critical event following such treatment. These 
probabilities are assumed unknown, but denoted by pi andp 2 ; and it is assumed 
that, independently for each of these, a priori p t is equally likely to lie in either of 
any two equal intervals in its possible range, (0, 1). Our available experience 
consists solely of the data; 

Of flx individuals treated by the first method, 7’! experienced the critical event 
and Si did not; and of % treated by the second, r 2 and s 2 were the corresponding 
numbers with respect to the critical event. 

In any given case it must be decided whether these requirements are met or 
not, and whether we may apply the well-known Principle of Bayes to convert the 
problem to the form of Section 2. Statistical criteria are often employed, however, 
in situations in which certain deviations from the conditions required in their 
development can be tolerated, when a better procedure is not available. 


Section 2. 


Consider the case of two infinite populations for which the unknown probabilities 
of occurrence of a given critical event are pi and p 2 , and the probability of obtaining 
a sample containing r critical occurrences and s failures in n = r + s trials is 

^ Pi r 0 PiY> where i = 1, 2, respectively. Furthermore, assume that one sample 

has been drawn at random from each population, the respective values of r and 8 
being denoted by and Si (where {=1, 2) and Ui = + # t -; and that independently 

for 7 = 1 or 2 the probability that pi lies in the interval (p,p + dp) is where 


( 1 ) 




v . v f tJ i> — 


p 

/rfi 

\-p r .<?.dp 


:o 

. p r .q* .dp 


where q~\—p y r- n, and ?i = r + s. Then 

(2) - ‘"nrli 1 J V- ■ * a c ; ') ■< 





William R Thompson 


287 


the last expression having been indicated by K. Pearson* in this relation. In the 
notation employed by him and by Miillerf we may write 


( 8 ) 


i - m 

p. • 




Bp(u f v 1 
Bi(u t v) 


w), 


where u = r< + 1 and v = + 1. The object of the present communication is to give 

a reduced J rational algebraic evaluation of the probability {Pp^px) that for the 
postulated systems exceed pi, and to indicate certain relations between its value 
(later designated by an( i sum °f r * + ^ ^ erms a hyper- 

geometric series which has appeared in the work of K. Pearson§|| as well as in the 
Incomplete B- and I -functions *f of (3). 


Obviously, we may write 

<♦> ^'jjf-*'-**-** 

J"l. + 1 > ! f 1 V ( H * +1 ).i*«.aH+H» -.dp 

rj!#i! j Oa~o V a ) F 1 

_ («| + 1)! % /w,+ 1\ (n +«)!(si+"4+1-«) ! 

v'i ! V « / («i + «2 + 2) ! 

_ («i+ 1)! (w,+ 1)! y (ri + a)!(#i + w 2 + 1 — a)! 

(»! + «»+ 2)! a - « ri!a!.«i!(«*+ 1 -a)! 

= («! + 1)1 («2 + 1 ) ! *5 (n + r 2 - «) K *1 + 1 + «)J 

(»r+ *;+ 2 ) ! ~ ' O “o ni' (r t - a)! *i ! (* 4 + 1 + a ji 


whence we have 
0 >) 

where, of course, n t = ?*/ -f a t - 


ri 4- ^2 — M + 1 + a 


Ppt>h ~ 




v 


*1 


: ) 


/ /ij + n 2 + 2\ 

' V N1 + I ) 


fi) 

(>•) 

(Hi) 

(iv) 

(v) 


Now, it is obvious that Pp % >p l * Ppi<P%~ P<k><it ~ ^v*<7 i> w ^ ere 7* - ^ an( ^ 
thus in similar manner we have 


( 8 ) 


P* . , = 

1 pi'' Pi 


si + s 2 - a\ (r x + r 2 + 1 + a\ 


y /.<?i + A’2 — a\ 
ato \ *2 / ' \ 




7?J 4- Wg + 2\ 

. Wi + 1 / 


Furthermore, = l —Ppi>pi* as probability that is exactly equal to p 2 is 

zero by hypothesis. Hence we have two other similar sums which may be used with 
this difference relation to evaluate the probability under consideration. 


* Pearson, Karl: JHometrika, Vol. xvi (1924), pp. 202—208. 

t Muller, J. H.: Biometnka , Vol. xxu (1930-31), pp. 284—297. 

X The earliest work directed to this end is discussed by Todhunter. Cf. A History of the Mathematical 
Theory of Probabilty . Cambridge and London (1865), pp. 419—420. 

§ Pearson, Karl: Philosophical Magazine , Series 6, Vol. 13 (1907), pp. 365 — 378. 

|| Pearson, Karl: Biometrika , Vol. xx A (1928), pp. 149—174. 



288 Likelihood that One Unknown Probability exceeds Another 

In actual evaluation we may make use of the well-known pyramid form of 
tabulation of the binomial coefficients, 

1 

1 1 
1 2 1 

13 3 1 

1 4 6 4 1 

(7) 1 5 10 10 6 1 

1 6 16 20 15 6 1 


which is readily constructed by the property that each entry except those on the 
boundaries (which are always unity) is the sum of the two nearest entries of the 
row next above*. The corresponding factors of the successive terms in the sum to 
be evaluated may be found in order on given diagonals, ascending in the case of the 
first and descending in the case of the second factor. The bounding diagonals of 
units may be deleted in practice (as may all entries to the right of the middle 
column). Then the first factor of the first term of the sum in (5) is the rvth entry 
in the row whose first entry (at the left) is ri + rj, and the corresponding factors of 
successive terms lie successively above on the diagonal through the first and parallel 
to the left-hand boundary. Similarly, the other factors are found, but proceeding 
in the opposite direction on the appropriate diagonal. 

Now, for any positive number, #, let Q (a ,> Then, by Stirling’s formula, 


we have 

e 

(8) 

m ! = Q( m ) . V2w.e -m . e 12m , " 

where 0 ^ 6 c 1 ; whence (4) (v) gives 

(9) 

P Pl>f>l = ft 1 Q Q • ^ ( r „) Qln+r.-o) • Q(«l+*l+l+«)> 

y («!+»,+» • y<r,) . y<«,> .=« \r % - a/ 

where 

|12w| < ~ +7 + - Vi ; 

7*i tli *4“ 1 


which may be used to advantage in approximation of Pp % >p 1 when r*i and 8\ are large 
and small. 

Now we may define ifr ( f*,*,**,^ for any four rational integers (not negative) as 
identical with the right member of (6), where n< ** r< + Si > 0. Then we have shown 
several equivalent expressions of the same function and that 

* Glaiaher, J. W. L.: “4 Table of Binomial-Theorem Coefficients,” Messenger of Mathmaties , 

Vol. 47 (1917), pp. 97 — 107. 



William R Thompson 


289 


From the conditions stated it may be expected that if we set p *= ^ and q » 1 — p, 
then (provided 0 <p< 1), 

(11) lim i. 

*,-►00 Iq (#2 4- 1, 4“ 1) 

That this is true may be verified if we exclude the cases, 0, 1. Further bounds 
to approximation of this limit by the ratio ( R ) of these functions for given values 
of 7i\ may be found as follows : 


By (4)(iv) we may write 


( 12 ) 




»•§»«*) 


/n,+ 1\ 

a~0 V « / 


(ri 4- a) ! (#i 4 ^4 1 — a ) ! 

r%\ ' s x \ “ ~ 


(wjl + w* 4- 2) ! 

“ Oh+T>! ~ 

and by (2) and (3), introducing the appropriate values of r*r* and s = 
we have 

(v 2 4- 1 ' 


(13) 


J q (s 2 4 1, r 2 4 1) - 2 ( ? ' a * ) .;> a . ^r n a +1 “°, 
«-o \ a / 


where P = ^ and q = ^ . Obviously, therefore, as all terms of both sums in (12) and 

(13) are positive, if we exclude the special cases where p or <y = 0, R is bounded by 
the greatest and least values attainable for the ratio of a term in the sum of (12) 
to the corresponding term in (13). Thus we may define 


(14) 


and 


- - «“• [cs7 c- + --: - t~ U’: r] 


for 0 ^ a ^ r 8 ; and, obviously, then 

(15) &>i</£< o) 2 , and lim [K] = l. 

*,-►00 

In the excluded cases it is also readily verified that 

(16) lim [><n.»i.r.,«.>] = •*»(*»+ 1. U+ !)• 

w, ■►co 

The relation of this function to the sum of a given number of consecutive terms 
of a hypergeometric series is particularly interesting in view of the occurrence of 
such series in the investigations of K. Pearson **j\ By (4)(iv) we may write : 

__ fa4l)l(r? a 4 l)l £ (r x 4 1 a) ! («! 4- 4- 1 -a)! 

Kli) Y(rt, * 1 , r„ *> ~ (Ui + rtB +*2) , ri J 8il • a ^ o ' ■ a! (^ + l -a)! 


* Pearson, Karl : Philosophical Magazine , Series 6, Vol. 18 (1907), pp. 888—878. 

+ Pearson, Karl: Biometrika, Vol. xx± (1928), pp. 149 — 174. 



290 Likelihood that One Unknown Probability exceeds Another 


which is the sum of the first r* + 1 terms of a hypergeometric series multiplied by 
a constant. Similarly, we may write 


(18) j.n.Jt.ri) — 


(«i + !)!(«» 

(»i + n% + 2) 


+ 1)! ^ («i + a)l(i’i + w 8 + l -«)! 

! r*i ! *x i a=0 «'(«* + l-o)l 

_ (n x + 1 ) ! (r> s + 1) ! ft (n + r 8 + l + «)!(»i + fr-c)! 

(«i + n*+ 2)! nlsj! a=» (r* + 1 +a)!(# l -a)! 

_ («i + 1) ! (n* + 1 ) ! "£ l T (n + a ) ! (s, + r> 8 + 1 - a)! 

~ («i + wt + 2)!n!si! «=r,+iL a ! («* + 1 — a) ! 
obviously (as « 2 = r 2 + « 2 ); and by previous demonstration 

(1®) r,.*,) + •».»■«) = 

whence 

(20) (» i + »i 4- 2) 1 ri ! st 1 = "Vf 1 (n fa ) ! (s x f w t 4 - 1 


«)! 


(«i 4- 1) ! (jit + 1) ! .=« a ! (« 2 + 1 - a) ! 

The last relation, demonstrated above by independent proof, has been 
established previously by Pearson * + (with different notation). Thus we may 
regard as defined by the identity 


$ (r 2 + a ) ! («! + wj + 1 - a ) ! 


( 21 ) 


, _ „=o o!(«i+l— a)! 

y(ri,.„r„.,) - ^l( n+a )|(s 1+nt+ l -a)V 


«=o a I (n* + 1 — a ) ! 

extending the domain of definition to include the value, r 2 = w a + l; but retaining 
the restrictions, n* - r< + s { > 0, and that - 1 be the least value of t\> «i, r a , and s 2 
(only one of which shall be admitted to be negative). Then, by this extension, 
we have 

(22) ^(r„ «! , 0 , 0 ) = > and V r <ri.*i.»«+1.-W S v 

which lie outside the domain of the initial discussion, and we extend to the new 
domain the relation of (10) formally ; i.e. 

(23) = V r («\r / ,*,r) s ^ ' l / r (*,r, «',r’) • 

K. Pearson I has considered the problem of likelihood of various values of R and 
S t the number of marked members and unmarked members, respectively, in a finite 
universe of aggregate number, N~ R + S] assuming N fixed and all values of 
R, 8 > 0 equally likely a priori and that our sole experience from which judgment 
is to be made is that a random sample has been drawn containing exactly r marked 
and s unmarked members (R and S being used here in place of Pearson’s p and q 
to avoid confusion). Then by (iii) and (iv) of the article f just mentioned, we have a 
means of evaluating the probability, Pr, that the universe contains no more than 
R marked members by the relation, 

(24) y^(r, #, «-»*, N-R-i-1) , 

* Pearson, Karl: Philoeophical Magazine , Series 6, Vol. 18 (1907), pp. 865—878. 
t Pearson, Karl: Biametriha , Vol. xx A (1998), pp. 149—174. 



William R. Thompson 


291 


which may be verified readily. Similarly, in the case of the problem considered 
earlier by Pearson*— having drawn one random sample from a oert&in infi nite 
population, the sample containing exactly r' marked and s' unmarked members, we 
are required to find the probability (under the given conditions) that if we draw 
another random sample of n " individuals from the same population it will contain 
no more than r" marked members — the required value is given for r" ^ n" by 

In the tabulation of values of ^ (r jf ^ ^ for ascending values of the arguments 
the work may be greatly simplified by certain relations in addition to those given 
in (22) and (23) in much the same manner as the binomial coefficients may be 
tabulated by mere summation of two values already given. To this end let us 
examine two functions defined by 


' 


and A--, «, r', •■) = A«, n » = (” ^ i for n — r + s, and n' *= r' + 

Obviously, by the original definition of extended in (21), (22) and 

(23), then 

(26) 

where we extend the definition of (25) for r, 8 , r\ s' > 0 by 

(^*0 Ar,f,-1,*') = 0, and Ar, *,/,—!) == D( r +t t r'—l)> 

and by (26) and (23) we have 

An *, r\ s') == A*', r', «, r) = Ar+*. »•'+«') “* (#, r, r')> 

as it is obvious that An,u) = A n»- Furthermore, by the well-known relation, 

<*» GHr!) + (“;’)• 

A »»,»') ^ A«»«-d + A«-i,«') » 

we have for « > 0 in (25) 

«°> 

whence we have in any case under its definition, obviously, by (20), (27) and (28), 
the identities 

1 ) Ar, /?, r\ #') = Ar, *~1, «') + JV ( r> ^ *'_i) 

s Ar-1,*, r / , «') + N ( r> ^ *') , 

By (25) and (29), obviously, the same relation holds for the D-function; and 
we may write 

(32) . 

Ar, #-l, i", + Ar, f , r', ✓-!) 


* Pearson, Karl: Phiheophical Magazine, Series 6, Vol. 18 (1907), pp. 865-878. 
t Glaisher, J, W. L.: “A Table of Binomial -Theorem Coefficients,” Meaenaer of Mathematice Vol. 47 
(1917), pp* 97—107. 



292 Likelihood that One Unknown Probability exceeds Another 


By means of these relations it is evident that it suffices to tabulate the N and 
the D- functions by additions of corresponding pairs of values already listed or 
readily obtained by the relations of (28) if we proceed from the lowest values of 
the arguments upward; and we need list values only for the cases where r + s^r' + s 1 
and os where all four variables may be restricted to positive values, as by (25) 
we have 
( 83 ) 

By (25) we may write 

r* 

(®^) ^ (r, 8, r \ 8') = 2 Af- 1, • An-1, aH-®) > 

sbO 

which is of value if D( n ,»') is tabulated for through ascending values of n 
and This may be done rapidly by means of (29) and the relation J5( n ,n') = An\n> 
and is more convenient than the pyramid form of the corresponding binomial 
coefficients. In the tabulation we may restrict n and n' to the positive integers, 
employing the relation D( B , 0 ) = n + 2. A short table of the N and D functions is 
appended as an illustration, the required probability being the ratio of these 
corresponding values. 

Section 3. 


If a system of operation such as suggested in Section I were adopted extensively 
for the case considered in Section 2, reference to values of > for small values 

of the arguments should be required frequently; and, accordingly, a simple method 
of formation of a table of these would be valuable. The method given in Section 2 
seems to serve this purpose; and, in conjunction with the relations of (28), many 
values need not be listed. The short table is given merely in illustration. It really 
deals completely with all cases of n, n' ^ 5 although certain cases are not listed where 
the values are readily obtainable from those given and (28). s The several general 
evaluations at the head of the table would permit deletion of many more, e.g. any 
instance where one of the four arguments is zero; but they have been retained for 
illustrative purposes. 


/Tb "j* 71^ ^ 2\ 

The function, D (B , n >) £ ^ ^ ^ ^ j , is readily tabulated in a convenient form 

for increasing values of 1, as has been mentioned. under (34) above by 

adding to n + 2 successively the values of D(„_i, , already listed, and taking a sub- 
total after each addition, and finally doubling the last sub-total. These sub-totals 
and the final double are the required successive values of D< B , B <). The value of such 
a table extends far beyond that of the immediate problem; and, by means of it and 
relation (34), Y'V, may be calculated rapidly or approximated with any required 
precision for considerably higher values of the arguments than it may be convenient 
to have tabulated #<«•,», r',»v In accord with some prescribed tolerance and limited 
extent, a table of approximate values of the D-function could be made with greater 
ease, but apparently not readily extensible within the same relative tolerance 
without revision. Just how far these tables should extend would depend upon 
demands for their use. All questions as to approximate methods should be decided 



William B. Thompson 

Short Table o/ JV ( r , and A».») (n == r + a, and n' ■< r' + s') 


298 





294 Likelihood that One Unknown Probability exceeds Another 

by several statisticians in consultation at a time when a definite programme for the 
use of these methods is formed. 

In (24) and the paragraph in which it stands is given the relation between the 
hypergeometrical series studied by Pearson*f and V r (r,«,r’,v) by means of which it 
is obvious that any approximation methods valid for estimation of P& in (24) are 
equally valid for the estimation of the corresponding ^-function, several of which 
have been suggested by K. Pearson. The /-function of Pearson is related to the 
function by (11) to (15) also; and another approximation of >•,,«,) is given 
in (9) with indicated domain of validity. 

A further treatment of the ^-function and the method of apportionment will 
be provided in a later paper. 

* Pearson, Karl: Philosophical Magazine , Series 6, Yol. 13 (1907), pp. 866—878. 
t Pearson, Karl: Biometrika, Yol. xx A (1928), pp. 149—174. 



ON ASYMPTOTIC FORMULAE FOR THE 
HYPERGEOMETRIC SERIES. 

i. hyPergeometrio series in which the fourth 

ELEMENT, x, IS UNITY. 


By 0 . L. DAVIES, M.Sc. 


The hypergeometric series F(a , 0, 7, 1) arises frequently in problems of chance 
when samples are taken from a finite population. For instance, in a population of 
size M in which p individuals possess a certain character A, and q{m M — p), not 
A , the chance of drawing a sample of size n in which there are r of A and 8 (** n — r), 
not A f is clearly 

w1 P Pzl 9- 1 SLZJ + 1 n \ 

r\ 8 \ M' M — \ M — r + 1' M — r' M — r — — n+1 " ' 

Hence the distribution of r and 8 in repeated samples of size n is given by the 
terms of the series 


N (M — n)\p ! f _ _ «2_ 

Ml (p-n)l [ il (p— n 


"(w-l)g(g-l) 


+ 1) 2! (p —n -h l)(p — n+ 2) 


■•••] -( 2 ), 


where N is the number of samples. IF we write n = — «, (/ *= — A jp 4* 1 — a » 7, this 
series can be transformed into the usual hypergeometric form 


< 7 -0-l)!( 7 -a-l)! . at 3 a(a + 1)0 (0+1) 

(7 — a — 0 ~ f)!(7 — 1 ) ! |. H7 2! 7(7 + 1) 


( 7 -0-l)!(7-a-l) l 

(7 — a — ^ — 1)1 (7 — 1)1 


F(a t 0, 7, 1) 


(3). 


The chance of drawing in a sample of size n at least r individuals possessing 
character A is equal to the sum of the first (r+ 1) terms of (2). This is what is 
meant by the probability integral of the series; its determination is of paramount 
importance. 

It is well known that, when M is infinite, and if p and q now measure the chance 
of an individual being A or not A respectively, the distribution of r and 8 in 
repeated samples of size n is given by the terms of the expansion of the binomial 
N (p 4* q) n . Its probability integral has been expressed by an incomplete Beta 
function *. When neither p nor q is very small and n fairly large, the distribution 
may be fitted very closely by a normal curve. If, however, either p or q is very 
small, the distribution may be represented by a Poisson series. This may also be 


* Karl Pearson, BiometHka , Yol. xvx. pp. 202 — 8 . 



296 On Asymptotic Formulae for the Hypergeometric Series 


fitted quite closely by a normal curve if n be sufficiently large. The probability 
integrals of both these distributions, normal and Poisson, have been tabulated*. 

Hypergeometric series of type F (a, ft, y, 1) may arise in at least two other 
waysf, namely: 


(i) The proportional frequencies of drawing in successive samples of a fixed 
size n, r marked and s(**n- r) unmarked individuals (r = 0 , 1, 2, 3, n), from 

a population having previously drawn a sample of size N with p marked and 
q (» N — p) unmarked individuals, are given by the successive terms of the hyper- 
geometric series 


F(a, ft, y, 1 ), where j® 


-n, ft=:q + 1, 

-(p + n), £ = 7-a-/3~l 


-(tf + 2). 


(ii) A sample of size n with r marked and 8 (■■ n — r) unmarked individuals is 
taken from a finite population of size N. The likelihoods of N having p marked 
(p *= 0 , 1 , 2 , . .. , N) and q(=*N — p) unmarked individuals are given by the successive 
terms of the hypergeometric series 

cF (a, ft, 7 , 1 ), 

where a«r + l, ft — — (N - n), 7 = — (A — r), 

£=37~a-/3-l = ~(« + 2), 


F(a, ft , 7 , 


} r(a+/8-7 + l)r(l-7)- 


An asymptotic expression for the remainder of the series F(a , ft , 7 , 1 ) after 
the ( 8 + l)st term has been provided by M. J. M. Hill J. It may be written in the 
form 


where 


It (a, ft > 7> 8 ) 


n (s + 1 , 7 - 1 ) (* + l) a ^-y/(a. ft, 7 , *) 


/(«, ft 7> *) 


n , (7 -<*) (7 - ft 


, (y-a)(y -<* + l)(y-ft) ( 7 -ft + l) 

(7 - a — /8 + 1) (7 “ « - ft + 2) (7 4* 8 + 1) (7 4* 8 + 2) 

I have applied this formula to a number of particular hypergeometric series and 
found that in most cases the series / (a, ft , 7 , s) did not converge with sufficient 
rapidity to be of practical use for finding the sum of a number of significant terms 
of the hypergeometric series. However, the formula was very useful for evaluating 
the extreme tail of the series. 


Professor Burton H. Camp§ has introduced a method for the approximate 
determination of the tail of a frequency distribution, continuous or discrete. His 


* Tablet for Statistician t and Biometricians, Part x, Tables II and LI, LII. 

t Karl Pearson, Philosophical Magazine, March, 1907, pp. 865 — 878; Biometrika , Vol. ▼. 1907, 
pp. 172 — 175, Vol. xra. 1920, pp. 1—16, Vol. xxK 1928, pp. 149-174. 

X Proc. Lond. Math . Soc . Ser. 2, Vol. 6. 

§ Biometrika , Vol. xvx. p. 168. 



0 . L. Davids 


297 


formula has been applied* to the series F (a, ft 7, 1), but it fails to give good results 
when the "stump” lies within ± 2 <r of the mode, a being the standard deviation of 
the series whose discrete terms are treated as a frequency distribution. 

In this paper we shall obtain approximations to the sum of a finite number of 
terms of the series F(a, ft 7, 1 ) by fitting to it a Pearson-type curve. The question 
will be investigated of how closely the probability integral of the series may be 
represented by the probability integral of a Pearson-type curve. 


The series 


it/. 0 .. 1N _, , afi , a(a + l)/ 3 (£ + l) , 
F(a, ft 7, !)=! + — + - ■ 2; 7(7+ i) " + - * 


when infinite, is convergent as long as 7 > a + ft The sth moment of the terms of 
the series, represented by a histogram with grouping unit “ c,” about a point c before 
the midordinate of the first block is 


(1 . 1 - + ,f- 2 - A+. 1 > M .+ 1 > 3 - + . . .) c-. 

\ l!y 217(7+1) / 

Applying Raabe’s test, the condition of convergence of this more general series 
is found to be 7 + £ + or e>s-l, where e~y-a-/ 3 ~l. Wo cannot, 

therefore, have an infinite hypergeometric series with all its moments finite. 

Convenient expressions for the moment coefficients of the above series, finite or 
infinite, have been found by Professor Karl Pearson. They are 

1T , . r( 7 )r ( 7 -«~0) 

N - sum of senes = T( —^^ 

_ T(a-y + l)T(fi-y + l) 

r (a + $ - 7 + i ) r (i — 7) 

, a /3 

Mi ■« — . 

, (?a/ 3 (a + e )(0 + e) 

— • 

c s a/3 (a + e) (/8 4- e) (2a + e) ( 2ft + e) 


€ > 0, 


6 < 0 , 




«•(€-!)(«- 2) 


+ 3(6 + e)a(a + e)ft(ft + e)]. 

We will now test the goodness of fit of the Pearson curve P («) to F(a, ft 7, 1), 
where P and F have the same first four moments. 


Ia. Infinite Hypergeometric Series, a, ft 7 positive. 

Since we are to fit by the first four moments, we must initially assume e > 3 to 
ensure all expressions finite. 

# Biometrika , Vol. xvii. p. 61. 

Biometrika xxv 20 



298 On Asymptotic Formulae for the Hypergeometric Series 


The first two betas of a hypergeometric series in which x is unity, have the 
following form: 

6-1 ( 2 a + <?)*( 2/8 + e)» 

Pl = ayS (0! + e) (/9 + e) 


■(e- 2 )*' 


- 2 )* ( 4 + a ( a + e)) ( 4 + £ (/9 + e)) ’ 

il . r_ 

- 8 ) [a < 


4 + 

\ 

e*(e-l) 


6 ( 6 + 1 ) 


) + 6 {“ («Ve) + (# + e)}] ' 


o 8 (e — 1 ) (e + 6 ) 

Pt (e — 2)(e — 3) i " (e- 2)(e- 8) [«(« + «) /3(/3 + e)‘ 

fli and 02 are functions of the three parameters a, 0 and e, being symmetrical with 
respect to a and 0. As either a or 0 increases, both 0 X and 0 2 decrease. When 
a and 0 are large with respect to e, i.e., when 7 approaches the value (a 4* 0), 0 X 
and 02 approach the simple expressions 

A-16: 


e -1 




(e-2)*’ 
3(e-l)(6 + 6) 


(e _2)(e-3) ’ 

The criterion K which aids in the determination of the type of a curve, is 


K — 


4 (2/8, -3ft - 6 ) (4&-3A)’ 


For the above values of 0 X and 0 2 ,K is unity. Hence, these two betas trace out 
a line on the 0 l9 0 2 plane which coincides with the Type V line. When e is fairly 
large, but still small compared with a and 0 , the distribution represented by the 
terms of the hypergeometric series tends to become normal 

Substituting the general values of 0 X and 0 2 in the expression 20 2 — 30i - 6 and 
simplifying, we have 

6*(6-l) r gf 1 , 1 1 1 6(6*-^76 + 4)_1 

(e — 2 )® (e — 3) |_1«-1 «(<* + «) / 8 G 8 + «)J a (a + e) 0 (0 + e)J 

12 e* 1 1 T - . \ o,o . \ 

- (.-»)■(« -8> i<r+ .i jw + «") L* '* + *> 6 * () 

+ (, - 1) [. (« + ,) + 0 (fi + oi - ? ■ 

The constants a, 0 and 7 have been taken positive, hence the point ( 0 X , 0 2 ) lies 
above, below, or on the Type III line 20 % — 30 x - 6 « 0 , according as the expression 

Z = a(« + 6)/3(/9 + 6) + (e-l){a(a + 6) + / 9(/9 + e)}- — 

is negative, positive or zero. 

The condition for a Type III curve is clearly X * 0. The algebra may be greatly 
simplified by writing 

cT(a + e) $ (/S + e) ™ 



O. L. Davies 


299 


The betas then take up the comparatively simple forms 

A = 


+ *<?* + '* Pi 


A 


_ 8(«-l)(« + 6 ) «»(«-l) 

f*-2Wc _ a\ + i*l9\ t*- a\ {«(« + !)p + 6<r) 


and we find 
2/8, 


( e — 2) (e — 3) ' (e — 2) (« — 3) 1 

3A - 6 = (7^2)® (7^3) [! + ( e - !) ^ 12 -J. 

4A - 3A = (7 z ifiBw) [12 + 12 (« — 1) «r + « ( e * + 5e - 8) p], 


A + 3 *> [6 + 6 (e — 1) <r + e® (e* — 1) p] 


( e — 2) (e — 3) ' 


Hence, substituting these values in the criterion 


K: 


A (A + 31 ® 


we find 


/r=i 


4(2A-3A-6)(4A-3A)’ 
[16 + 4e®<r + e 4 /a} 


{ 12 + 12 (e- l)<r-e (e-l)(e®- 7 e + 4 ) p] 

{6 + 6 (e— 1) <r + e(e* — 1) p{* 

X {12 + 12 (e — l)<x + e («* + 5e — 8)/>j * 

a, ft, 7 and e are, by hypothesis, positive quantities. Hence, a, p , a 2 and a p are 
positive quantities. The expression e 2 - Te 4- 4 is a minimum when e — \. Its value 
is then — ^ — 9 . Hence 

(lb + 4t 2 <r -f c 4 p) (6 + 6 (e — l)<r + e (e 2 — l)p] a 

* {12 + 12(6 - l) <r + 9e (e - 1)>] X (12“+lT(e- 1) o-“+7(e*“+ Se - 8) p] ' 

Comparing the coefficients of o-, p, ap, er 2 , p a in the numerator and denominator 
of this expression, we find that K is always greater than unity. Hence the two 
betas of the hypergeometric series F (a, ft, y, 1) with positive constants, lie above 
the Type Y line which divides the Type IV and Type VI regions on the fti, ft 2 
plane. 

By making a and ft indefinitely large compared with e (which is equivalent to 
making the quantities p and <r indefinitely small) we have already shown that the 
first two betas tend to lie on the Type V line. Therefore, the Type V line forms the 
lower bound to the fti and ft 2 area of the above hypergeometric series. This is 
significant, because it means that when fitting by the first four moments, the 
Type IV curve never arises. 

It now remains to find the upper boundary to the ft t and ft a area. This will 
be found as follows. 

The hypergeometric series 


L aft «(«+ DflQg + 1) 
f + l!7 2! 7 ( 7 + l) + 


••• | y*> 



300 On Asymptotic Formulae for the Hypergeometric Series 

where y 0 is the inverse of the sum of the series, may be written in the form 
aft /aft (a 4- 1) (ft 4* 1)^ 

1 / rv rv \ 

+ ... 


l + i + L 
+ 1 ! + 2 ! 


1 + 


y o- 


.off. 


Make a, ft and y tend to infinity in such a way that — remains constant and 
equal to m . In the limit the series takes the form 


• (»+5*if+*f+- 


which is the exponential or Poisson series. Hence, the Poisson limit to the binomial 
is also a limit to the hypergeometric series. 


Now 

therefore 

In the limit, therefore, 


€ = y-a-/3-l, 

a/3 a/3 ft a a/3' 

€ 7 1 

a/3 aft m' 


Hence we may obtain the same Poisson limit by making a, ft and e tend to 
(iR 

infinity in such a Way that remains equal to a constant quantity m. 


We have 


and 


€-1 (2a + e)* (2£ + e'f 
”"(e- 2)*a£(a + e)(£ + €) 

(e-l)g/9 (a + a») + 


S - »(•-!)(« + «) ■ *(■-!) 

* (e — 2) (e — 3) + (e-2)(e-3) 


a/S 


(«+l) 


.^(s + ?)G + ^) 


+ 6 


( 1 4- - -A I 

(a (a + e) /9(£+e)J 


Proceeding to the limit, fit and ft 2 take the values 



O. L. Davies 


301 


Eliminating m, we arrive at the Poisson line 

#2 — 3 * ft- 

Substitute now the general values of ft and ft in ft — 3 — ft, we find 

A - A - 3 * (7172/7 7^3) K 7€ * “ 16c + 12) + «* ( € ~ 1 ) (» + e 1 />)]• 

Since e > 3, this expression is always positive. Hence the points (ft, ft) never 
lie above the Poisson line. They never actually reach this line, which may be con- 
sidered as a mathematical limit. 

Hence, the limits to the ft , ft area for the hypergeometric series in which a, fi 
and 7 are positive quantities and x unity, are 

{ ft ~ 3 = ft upper limit 

ft (ft + 3) 2 = 4 (2ft - 3ft - 6) (4ft - 3ft) lower limit. 

3 as 0) 

These two lines arise from the Gaussian point gj- . 

Examples. 

I. a = 10, £=30, 7 = 101, 

e = 7 — « — £— 1=60, c = 1. 

Substituting these values in the formulae for the moments and betas, we find 

log N= 1-639,8377, 
v t = 8-898,305,085, 


£,'= - 855 , 217 , 711 , 
£,' = 4 - 351 , 447 , 584 . 


Position of the mode measured from start of histogram is 

4-209, b77. 

Mean of series measured from start of histogram is 

Pi - a - + 5 = 5 - 5 . 

€ 

Corrected moments and betas, by Sheppard, are 

/i, = 8-814,9718, 

£,= -879,70242, 

&= 4-377,2277. 

The point (£,, £,) falls in the Type VI region, and the curve which has the same 
first four moments as the above, is 

y - y t {x - 164-844,49 1) 4 **-’ 47 


where log y 0 = 301*446,5204. 



302 On Asymptotic Formulae for the ffypergeometric Series 

For the curve : mode » 170’1 21,456, 

mean — 171*471,584. 

Hence, equation of curve referred to mean is 

y = y 0 (* + 6-627,093) 4 ' sa > 747 (x + 171*471 ) 584)- 13#1 “- 0M (i). 


Terms of series 

— 

Midordinates of (i) 

. 

Areas under (i) 

1 *000,000 

•864,975 

•988,594 

2-970,298 

2-894,099 

2*897,940 

4*965,056 

5-023,947 

4*985,139 

6*170,167 

6*276,603 

6*229,540 

6-362,985 

6*437,399 

6*403,479 

5*769,107 

5-792,867 

5-776,574 

4-762,233 

4-751,426 

4-748,863 

3*662,279 

3*638,780 

3*644,017 

2*666,172 

2*643,873 

2*652,174 

1*858,982 

1*843,221 

1*851,678 

1*252,279 

1*243,150 

1*250,392 

*820,494 

*816,128 

*821,744 

*525,629 

524,004 

•528,083 

*330,620 

*330,251 

•331,081 

*204,877 

*204,941 

*206,833 

*125,420 

*125,512 

•126,744 

*076,022 

•076,011 

•076,828 


II. a = 0 — 30, 

e ” 50, c = 1. 

Uncorrected moments and betas are 

log N= 4-974,5371, 
v t = 47-020,4081, 

0i = -540,5824, 

0i = 3-944,4733. 

Corrected moments and betas are 

fii = 46*937,0748, 

0 1 = *543,46676, 

02 * 3*947,83375, 

and the Pearson curve with the same first four moments is 
y=y 0 (x- 86*467, 3846) u,408 . 4 *“ 
log y 0 “ 127*806,2761. 

Mode of series measured from start of histogram = 16*173,077. 
Mean of series measured from start of histogram = 18*5. 



Histogram of Series F(10,3Q 101,1) 


(X L. Davies 


303 



304 On Asymptotic Formulae for the Hypergeometric Series 

For the curve: mode = 107 *002,995, 

mean = 109376, 152. 

Hence, equation of curve referred to mean as origin is 

y - r/o (x + 22'908,767l) 13 ' 408 ' 498 * (x + 1 09-376, 1517)-“™’ l “ 8 . 


Terms of 
series 

Midordinates 

o£(i) 

Areas 
under (i) 

Terms of 
series 

Midordinates 
of(i) ' 

Areas 
under (i) 

1*000 

1-661 

2*143 

2219-294 

2216*193 

2217*745 

8-108 

9*338 

10-122 

1881-510 

1878-842 

1880*440 

34*786 

36*842 

37-594 

1582*046 

1579-832 

1581-402 

105-074 

104*401 

107*483 

1320-269 

1318-493 

1319-981 

250*993 

246-924 

251*384 

1094-251 

1092-860 

1094-233 

504-485 

496*498 

501-947 

901*236 

900*190 

901-428 

887-922 

876-844 

882*507 

738*007 

737-243 

738-340 

1405-064 

1393-196 

1398-031 

601-165 

600*631 

601*587 

2037*641 

2027*573 

2030-881 

487-341 

486*964 

487-788 

2747-294 

2741-426 

2742*532 

393-330 

393-072 

393*773 

3482-197 

3481-828 

3480*533 

316-176 

316-006 

316-596 

4185*962 

4191*145 

4187*632 

253*220 

253-098 

253-591 

4806-421 

4816*147 

4810-885 

202*116 

202*003 

202-441 

5302-393 

5314*851 

6308*492 

160*831 

160*768 

161*104 

5647*539 

5660*936 

5654*136 

127-619 

127*564 

127*839 

5831*270 

5843*824 

5837-208 

101-007 

100*957 

101-180 

5857*304 

5867*936 

5861*957 

79-758 

79*708 

79*889 

5740*646 

5748-560 

5743*552 

62-8.46 

62*796 

62-941 

5503*945 

5508-994 

5505*1 10 

49-426 

49*375 

49*492 

5173*842 

5176*205 

5173*474 

38-804 

38*751 

38*844 

4777*844 

! 4777*864 

4776*227 

30*417 

*30-365 

.30*439 

4341-916 

4340*233 

4339*546 

23-809 

23-758 

23*817 

3888*884 

3886*114 

3886*210 

18*613 

18-564 

18*610 

3437-577 

3434*306 

3434-994 

14*535 

14-488 

14*525 

3002*535 

2694-189 

2999*010 

2590-791 

3000*138 

2592-192 





III. a = 94, o-l, 

7-589, e = 400. 

Raw moments and betas are 

log JV= 7-807,6359, 
p, = 33-776,6619, 
fii - -198,6389. 

&' = 3-210,1843. 

Mode of series — 21-514,925, 
mean of series — 22-59, 

both being measured from the start of the histogram. 

6 (8 8 — 1 ) 

The quantity r = is very large, of the order 6000. We are justified, 

therefore, in fitting a Type III curve. 




O. L. Davies 


305 



306 On Asymptotic Formulae for the Hypergeometric Series 


Corrected moments are 

33*693,3286, 

/<*« 73-354,7625, 
whence &«■ -140,67776. 

The Pearson-type III curve having the same first three moments is 

y = y 0 e- >a ' tm »a*' m ’ sm , 

l°g yo =* 22096,8526. 

For the curve : mode = 29-863,491, 

mean = 30-952,057. 

Equation of curve referred to mean as origin is 

y = os* + 30-952,057 ) a7 ‘ 4a8 ' 8134 . 



With the exception of the first few terms we see that the areas under the curve 
accord very closely with the terms of the series when the standard deviation of the 
latter is not small, or what is equivalent, when the number of significant terms is 
not small. (See also pp. 302 — 304.) 







O. L. Davies 


307 



X f 


308 On Asymptotic Formulas for the Hypergeometric Series 


16 . Infinite Hyper geometric Series in which two of a, &, 7 are 
Negative Non-integral Numbers. 

(i) a and fi negative, 7 positive. 

Let a be the largest integer in |o| and b the largest integer in |0|, then 
a = -a-8i, 

(8* — J-Ji 0< $1, 8*<1. 

Assume that b > a, i.e. & < 0. The (a + l)st term of the series is 

aja + lKg + 2 )...(-S 1 )/ 3(/8 + l ).. .(/3 + a) 

(o + l)l 7(7 + 1)... (7 + a) 

This term is positive. The (a + 2)nd term is 

a(a + l)...(-a 1 )(l-a 1 )^(<9 + l)...Q 9 + a)(fl + o + l) 

(a + 2)! 7(7 + 1)... (7 + 0 + 1 ) 

In order that this and all subsequent terms be positive, the following condition 
must be satisfied: 

p + a + l >0. 

Now a — - a — Si 0 < < 1, 

@ = — a — S% V >61. 

Therefore /9 + a+l = (l— B%) > 0, 

i.e. 1 > Bf > 0. 

A necessary and sufficient condition for all terms to be positive is, therefore, 

o = 6. 

Since the series is infinite, the condition e > 3 must be satisfied if we intend 
fitting curves by the first four moments. 

When e > 4 , the first two betas lie above the Type III line. When also 

-(e-l)S*2>-l, 

the first two betas lie above the Poisson line. In either case the corresponding 
Pearson curve is of Type I (or the types associated with Type I). (For proof of this 
statement, see corresponding section under finite series, pp. 298 — 9 .) 

(ii) a and 7 negative, /9 positive. 

As in the previous case, in order that the terms of the series be all positive, the 
condition 1 7 — o | < 1 must be satisfied. 

Now e = 7- a- /8-l > 3 , 

<-£. 


O. L. Davies 


300 


Hence, for all expressions to be finite, (— 0) must be at least greater than 3. 
This implies /S < — 3, contrary to the hypothesis that 0 is positive. A positive e 
would imply a negative 0, hence we cannot have an infinite convergent hyper- 
geometric series with all terms positive and y negative. 

Example. 

a = 0 ss — 256 1 
y = 30, c = 1 j * 

Crude moments and betas are 

log N = 5-369,3566, 
v t = 3-835,2290, 

/Si'* -004,688649, 

0 t ' -2-949,4076. 

Mean of series = 8-671,571. 

Mode of series = 8 607 ,786. 

Corrected moments and betas are 

/** = 3-751,8956, 

0i = -005,007951, 

0i = 2-947,7272, 

and the Pearson curve having the same first four moments is 

y * j/o (16-041,94 + «) 39 '* a0 ' 816 (23-00322 - a;) M ' 4M ' 871 , 
log y 0 = 120-789,4740. 

Origin at mode. Mean-mode of curve = "0714,082. 


Terms of series 

Midordinates of curve 

Areas under curve 

1*0 

1-2 

1-6 

21*8 

19*7 

24-5 

213*2 

188-3 

216-6 

1,237-1 

1,131*3 

1,232-5 

4,786-7 

4,554*5 

4,764-9 

13,136-9 

12,886*0 

13,122-7 

26,546 5 

26,538-9 

26,576-1 

40,468-6 

40,829-6 

40,514-2 

47,299-1 

47,808*2 

47,300*7 

42,840-2 

43,154-0 

42,799*3 

30,269-4 

30,271-0 

30,273-0 

16,741*7 

16,561-8 

16,750-5 

7,253-4 

7,064-9 

7,270-7 

2,457-1 

2,339-8 

2,463-8 

648-0 

596-6 

646-6 

132*1 

115*6 

129-8 

20*6 

16-7 

19-6 

2*4 

1-8 

2-2 




310 On Asymptotic Formula# for the Hypergeometric Series 



Here again the accordance between the terms of the series and the areas undei 
the curve is very close in the significant parts of the curve. (See also pp. 302 — 4 
and 306.) 

II. Finite Hypergeometric Series . 

The hypergeometric series 

l i g («-H)£(ft+l) , 

1 l ! 2! 7(7-1- 1) 

is finite only when a or fj or both are negative integers. In order that the seriet 
may represent a frequency distribution, each term must be positive. Hence, eithei 

(а) a and @ are negative with one at least an integer, or 

(б) a and 7 negative, a a negative integer and 1 7 1 > | a | . 

These two sets of conditions give rise to two distinct types of series which wil! 
be considered separately. 

(a) a and /3 negative. 

In order to find the position of the point (fa, fa) for such series, relative to th« 
Poisson line, substitute the expressions for the fas in fa — fa- 3 = 0. We ther 
have 

-jV*) .-5TF7) ? ^ f7) K»- , -i6 e + i2)^(« + .)W + .) 

+ e*(e - 1) {a (a + e) + £ (/8 + e)} + «* (e — 1)*]. 




O. L. Davies 


811 


2 11 

If e > 3 , the coefficient 7 ^ — - — r -5-7-5— — t is positive. Hence, 

(e - 2)® (e - 3 ) a (a + e) p (p + e) r 

the sign of /S* - $1 — 3 is the same as that of the expression 

E=a(a + e)&(l3 + e) { 7 « 2 - 16 e + 12} + e 8 (e — 1) {a (a + e) + /9 (0 + e)} + € 8 (c - 1)*, 

E is negative when a«)8«-l and also when a » /8 « - (e — 1 ) for all per- 
missible values of e. When either a or # is greater than — 1 >E may be positive, in 
which case the point (fa, / 9 a ) will lie below the Poisson line. We will now show 
that for 4 , E is negative when a and /9 lie in the range (— 1, — e + 1). It will 
be sufficient to prove that under these conditions E is negative at all its points of 
maxima and minima. 

Differentiate E with respect to a and & respectively. 

~ = (2a + <?) [«» (e - 1 ) + fi 09 + «) (?« 9 - 16 « + 12)], 

|g- ( 2/8 + e) [e 8 (• - 1) + a (a + e) ( 7 «* - 16 e + 12)]. 

Clearly, the points of maxima and minima are given by 

(i) fi (fi + e) = a (a + e) - — e*( 6 — l)/( 7 «* - 16 « + 12), 

(ii> « = ^ = | . 


At point (i), E has the value 

a (a + € ) [€ 2 (e + 1) 4- 0 (fi + e) ( 7 e 2 - 16 e + 12)] 

+ c 3 (e — l) 2 + /9 (£ + e) e 3 (e — 1 ) 
= 6 3 ( €~ 1) [ ( 6- 1) + ^ ( / 9 + 6 ) ] 

2 ) 2 (€- 3 ) 

(Ts 1 - 16s + 12)“' * 


This is negative for all values of e > 3 . 


At point (ii), E has the value — ^ (e — 4 ) (e — 2 ) 2 . This is again negative for 

€ > 4. It follows, therefore, that E is negative when € ^ 4 and — 1 ^ ^ j > — (e - 1 ). 
The corresponding betas then lie above the Poisson line. 


When e = 4, then (a + / 9 )>— 5 , in which case the values of a and 0 which 
result in a series with the largest number of terms are, respectively, — 2, — 3 . This 
is a series with three terms. Hence, for finite series with a minimum of three terms, 
the first two betas lie above the Poisson line. 


As previously (p. 300 ), if - a, — 0 and 7 tend to infinity in such a way that 

— remains constant and equal to a finite quantity m, the finite hypergeometric 
7 

series will, in the limit, become a Poisson series. 



812 On Asymptotic Formulae for the Hypergeometric Series 

4 

When a and 0 are fractional, the series is infinite (see p. 308). a and 0 may then 
be greater than — 1 and the corresponding betas may lie below the Poisson line. 
However, by substituting in 20 z — 30 x — 6 it can be readily shown that for e > 4 and 
for any negative values of a and 0 the first two betas lie above the Type III line. 

The series (2) (p. 295) is finite, a and 0 are negative integers. Hence, for e=i\T>4, 
its first two betas lie above the Poisson line. 

(6) Second type of finite aeries . 

a and 7 negative, a an integer; 0 positive and | | > \ a\. 

The last condition is introduced to ensure that all terms are positive. 

Now 7<a, 7 — a — £ — 1< 

i.e. e<-(£ + l). 

0 is positive, hence e is negative. 

The position of the first two betas of such series relative to the Poisson line is 
determined from the sign of the expression 

exh) [<"- +•><*+«> 

H.€»(e- 1 ){a(« + 6) + /9(/S + e)} + e 8 (e~ 1 )*]. 

The term outside the square brackets is always positive. It is sufficient, there- 
fore, to consider the sign of the expression inside the brackets. This becomes, after 
substituting for a, e, (e 4- 0) etc., their absolute values — a', — e\ — (e' - 0) etc., 

- (7e' 2 + 16e' + 12) a 9 («' + «') 0 (e 9 - 0) 

4- e' 8 (e' + l) {(e' 4- a') a 9 - (e 9 - 0)0 } - c' 8 (e 9 4- 1)*. 

This expression may be positive, negative or zero. The firsft two betas may, 
therefore, lie below, above or on the Poisson line. 

Rearrange in the form 

a 9 (a 9 4* e 9 ) |Y 8 (e' 4- 1) - 0 (*' - 0) (7c' 2 4- 16e' + 12)] - c' 8 (e' 4- 1) [(e 9 + 1) 4- 0 (e' - 0)]. 

a 9 may vary independently of e'. Hence, this expression will be negative for all 
values of a 9 only when 

e' 8 (e 9 + 1) - £ (e 9 - 0) (7e' 2 4- 16e' + 12) < 0, 
ie & R/ 1 

/9*- /9e + 7 6 '» + i 6e ' + i2 45 °* 
i.e, when 0 lies between the values 

2 T(e' + 2) v / 7 e '* + Ie7+ 12] • 

When 0 lies outside these limits, the first two betas of the corresponding series 
may lie above the Poisson line. 



O. L. Davies 


313 


However, by substituting in 2ft — 3ft — 6, we can show \hafc for all permissible 
values of ft the corresponding point (ft, ft) lies above the Type III line. For the 
position of this point relative to the Type III line depends on the sign of the 
expression / 

[a(a + e) + £ (£ + € )]<€- 1) + a/9 (a + e) (ft + e) - "jg"" (**-7e + 4). 


Putting in for the constants their absolute values, this expression may be 
rewritten in the form 

- (e' + 1) {«' (a' + *')-/8 («' - 0)} - a'yS ( e ' - £) («' + e') - ± . 

Rearranging the terms, 

& («' - j8) [(e' + !)-«' («' + e')] - «' («' + *') (e' + 1) - (e'» + 7e' + 4) (i). 


This can be positive only when (e' + 1) > a! (a! + e'). 

For such values, the maximum is reached when yS — J. Substitute, therefore, 

It 


fi - 2 * n (*)• We ^ ave 


• a' ( a ' + *') £(«' -f 1) 4- j 


(e' + l)e' 8 e' (e' + 1 ) ( € ' a 4- 7c' + 4) 


12 




This is evidently negative for all permissible values of e'. Hence, the corre- 
sponding betas always lie above the Type III line. The series referred to in 
paragraphs (i) and (ii) (p. 296) and Examples II (p. 315) and IV (p. 318) are 
illustrations of finite series with negative 7. 

When /? = 1, the hypergeometric series reduces to 

7 7(7+1) 


This is finite when a is a negative integer. The terms are all positive and 
finite when 7 is also negative and | 7 | > J-ct |. Substitute for a and 7 their absolute 
values, the series then adopts the form 


1 , 1) , «'(«'-!) (a' -2) , 

7 ' 7 (7 - !) 7' (V - 1 ) (7 - 2) 


(iii). 


When 7 the distribution represented by the terms of the series tends to 
become rectangular. For all other values the distribution is J -shaped. 

In order that a hypergeornetric series may represent a U-shaped distribution, 
the following conditions must be satisfied : 


(i) Series finite. 

(ii) 7 >aft 

(iii) There must exist an antimode, i.e. from some point onwards the terms 
of the series must be constantly increasing, i.e. 

(a ~r)Q8~r) 

(f+i)(y + r) >l 


for 


(r > r 0 . 
tr 0 < | a | . 


Biometrikft xxv 


21 



314 On Asymptotic Formulae for the Hypergeometric Series 


Case (a), a and fi negative. 

( a +rM± , f ) « (I “JZlldA I - r ) < ^ . 

(?4l)(y + r) (r + l)(y + r) 7 ‘ 


Hence, if initially we have y >afi, the terms of the series will form a monotone 
decreasing sequence~and in no case can they represent a U-shaped distribution. 


Case (6). fi positive, a and y negative. 

For a U-shaped distribution, we must initially have y>afi . There must also 
exist a positive r for which 


(fi + r)(a' —r) 


>1 r<a' 


(r+l)(y'-r) 

where a ' and y are the absolute values of the constants, (iv) is equivalent to 


(iv), 


y'-a'fi 

r> a' - fi — y' + 1 , ’ 0<<r- 

r 0 must be less than a', i.e. 

7'<a'*-a , 7' + a' „ r 7 '(l+a')< a' (!+«') or y<a\ 


contrary to the hypothesis that y' is greater than a'. 

In no case, therefore, can a hypergeometric series with x = 1 represent n U-shaped 
distribution. Since the upper branch of the biquadratic on the fi lt fit plane forms 
the lower bound to Pearson U-shaped curves, it also forms the upper limit to the 
first two betas of hypergeometric series in which the fourth element x is unity. 

Examples. 

I. a = — 30, £ = -50, 

7 = 1G0, c = 1. 

The variance and betas of the series after applying Sheppard's, corrections are 

/*, = 4-97.1,8996, 

£i« -018,308731, 

£a = 2-956,3878. 


The sum of the series is given by 

log Nm 4-740,1506. 

Mean and mode of series, measured from the start of the histogram, are 
respectively, 

8-879,8883; 8-734,8066. 

The Pearson-type curve having the same first four moments as the series is 

y = yo®* 7,7 ® 7 '"® (42*504,879 - xY >im > m (i), 

log y 0 m 104-415, 7762. 

Mode of curve » 14-772,980. 

Mean of curve — 14-931,387. 



O. L. Davies 

Equation of curve referred to the start of the histogram as origin is 
y - y 0 (6*051,499 + xf 1 ™*** (36*453,380 - 


Terms of series 


1*00 
15-00 
105-52 
463*46 
1427-50 
3283-25 
ft 862-95 
8344*04 
9640-48 
9164-40 
7239*04 
4786-14 
2662-56 
1250-82 
497*32 
167*52 
47*80 
11-54 
2*35 




Corresponding 
midordinates of (i) 

Corresponding 
areas under (i) 

1-17 

1*50 

13-81 

16*29 

195-89 

105-90 

434*52 

459*18 

1382*39 

1419*70 

3253-92 

3281-78 

5887*76 

5873-85 

8424-94 

8357*10 

9736-84 

9640-82 

9231-93 

9152-52 

7262-68 

7241-46 

4778-35 

4785-49 

2641-87 

2668-52 

1230-47 

1256-37 

482-74 

499*43 

159*16 

167*28 

43-90 

46-99 

10-06 

11-00 

1-90 

2-12 



Histogram of Series F (-30,-50, 100,1) 

Curve: (6£5l,*99+ac)* TW ' 858 

\ X(.36-453,380-»? S2 ' o<m> ' # ** 


2 3. 4 

— *x 


5 6 7 8 9 10 11 12 13 14 15 16 




816 On Asymptotic Formulae for the Hyp er geometric Series 

II. a = -30; £ = 60, 

7 * — 81 ; c — 1. 

Corrected moments and betas are : 

Ht = 9-293,3598, 

&- -00127,2249, 

£, = 2-895,9895, 
log 8-459,5208. 

Mean and mode of series measured from the start of the histogram are, 
respectively, 

16-571,4286; 16-627,2727. 

The Pearson-type curve which has the same first four moments as the series is 

y m (45180, 196 - .*)“*», m ( j). 

log y„ = 63-565,6609. 

Mean of curve = 24103, 296. 

Mode of curve = 24-161,973. 

Equation of curve referred to the start of the histogram as origin is 
y - y 0 (7-53 1 ,869 f (37-648,327 - 


Significant terms of 
series x 10~* 

Corresponding 
midordinates of 
(i) x 10~* 

Corresponding areas 
under curve x 10" 2 

2*5 

2‘4 

2*8 

18*0 

16*9 

190 

98 

91 

99 

424 

396 

424 

1,511 

1,430 

1,506 

4,560 

4,374 

4,545 

11,869 

11,523 

11,847 

27,026 

26,499 

27,012 

54,388 

53,738 

54,410 

97,496 

96,871 

97,662 

156,573 

156,191 

156,658 

226,219 

226,284 

226,259 

264,892 

295,519 

294,839 

347,416 

348,551 

347,278 

370,116 

371,542 

369,980 

366,383 . 

357,754 

356,339 

309,670 

310,641 

309,847 

242,148 

242,492 

242,294 

169,699 

169,420 

169,811 

105,979 

105,291 

106,002 

58,829 

57,739 

58,480 

28,294 

27,647 

28,228 

11,810 

11,405 

11,770 

4,177 

3,983 

4,170 

1,219 

1,152 

1,229 

282 

268 

293 

48-7 

48-2 

55*0 

5-6 

6*4 

7*6 




O. L. Davies 


817 



6 7 8 9 10 11 12 13 14 16 16 17 18 19 20 21 2 2 23 2* 25 26 

* : x 


III. a = -100, /S = — 100, 

€ = 200, c - 1. 

This series is symmetrical. Its corrected moments and betas are 

/**- 12-479,4807, 

A- o, 

ft- 2-989,9710, 

IV- 9054:849 x 10“ 


Significant terms of 
series x X0~ M 

Corresponding 
midordinates of 
normal curve x 10 _M 

Corresponding 
areas x 10~ M 

1017*906 

1022*670 

1018*286 

979*380 

982*400 

979*826 

868*746 

871*141 

859*173 

712*664 

713*000 

712*344 

639*799 

53B-627 

539*081 

377*691 

376*669 

376*862 

243*821 

241*705 

243*237 

146*287 

143*677 

144*966 

79*866 

78*721 

79*806 

40*467 

39*837 

40*670 

18*896 

18*607 

19*042 

8*126 

8*022 

8*256 

3*215 

3*192 

3*305 

1*170 

1*172 

1*266 

*391 

*397 

•418 

•120 

*124 

•130 

*034 

*036 

•039 

*009 

•009 

•010 

*002 

*002 

•003 




318 On Asymptotic Formulae for the Hypergeometric Series 

Mean « mode — 60'5, measured from the start of the histogram. /S» is sufficiently 
near 3 to iustify our fitting the normal curve e cr * 3*532,631. 

Since the series is symmetrical, the terms after the mode only will be given. 



IV. a= 1, £ = -60, 

7 = — 65, c ■* 1. 

The hypergeometric series having the above values for its constants is very 
abrupt, the maximum term being the first. It is necessary, therefore, to apply 
abruptness corrections to the moments. The corrected moments about the start of 
the histogram are 

Mi' = 9063, 5495, 

/i*'= 143-738, 0331, 

= 3038-952,2009, 

and 2\T*11. 

The best fit is obtained by fixing the start and fitting a Type I curve by equating 
the first three moments about the stump. This curve is found to be 


y « y oX - (63349, 7 29 - (i). 

This is sufficiently close to the curve 

y « yo (63*349,729 - (ii), 

log yo *9-028,401 9 


for practical purposes. 


O. L. Davies 


319 


The area up to a point s is given by the following relation ; 

Na, = y„ |* (6 - x)» dx = x [6** 1 - (6 - s)^]. 


Terms of 
series 

Midordinates 
of curve 

Areas under 
curve 

Terms of 
series 

Midordinates 
of ourve 

Areas under 
carve 

1-000,000 

•990,687 

•999,980 

•922,922 

•033,688 

•033,661 

•033,690 

•923,077 

•922,722 

•028,734 

*028,709 

•028,733 

•860,962 

•850,665 

•850,865 

*024,380 

•024,357 

•024,381 

•783,426 

•783,163 

•783,344 

•020,571 

•020,549 

•020,569 

*720,246 

•720,021 

•720,194 

•017,263 

•017,233 

•017,252 

•661,200 

•661,018 

•661,183 

*014,377 

*014,359 

•014,376 

*606,108 

•606,944 

*606,097 

•011,898 

•011,882 

•011,897 

•554,743 

•664,603 

•554,748 

•009,774 

•009,759 

•009,773 

•506,921 

•606,800 

*506,939 

•007,961 

•007,951 

•007,962 

•462,464 

*462,350 

•462,479 

•006,432 

*006,421 

•006,432 

•421,163 

•421,074 

•421,200 

•005,146 

•005,136 

•004,066 

•005,145 

•383,876 

•382,799 

•382,914 

•004,074 

•004,074 

•347,424 

•347,357 

*347,472 

'003,188 

•003,182 

•003,188 

•314,648 

•314,589 

•314,690 

•002,464 

•002,457 

•002,464 

•284,394 

•284,340 

*284,442 

•001,877 

•001,873 

•001,877 

•256,512 

•256,463 

•256,557 

•001,408 

•001,405 

•001,035 

•001,409 

*230,861 

*230,816 

•230,900 

•001,037 

*001,038 

•207,303 

•207,262 

•207,346 

■(XX), 749 

•(XX), 748 

•000,751 

•185,709 

•165,953 

•185,670 

•185,745 

•000,529 

•(XX), 528 

*000,531 

•165,915 

•165,987 

•000,364 

•000,363 

•000,365 

•147,915 

•147,878 

•147,944 

•000,243 

•000,243 

•000,244 

•131,480 

•131,444 

•131,507 

•000,156 

•000,156 

♦000,157 

•116,539 

•116,504 

•116,562 

•000,096 

000,096 

•000,097 

•102,988 

•102,954 

•103,007 

•000,056 

•000,057 

■000,031 

•000,057 

•090,727 

•090,694 

•090,745 

•000,031 

•000,031 

•079,663 

•079,631 

•079,675 

•000,015 

•000,016 

•000,016 

•069,706 

•069,673 

•069,716 

•000,007 

•000,007 

*000,007 

•060,769 

•052,773 

•060,738 

*000,778 

•000,003 

•000,003 

•000,003 

•052,743 

•052,778 

•000,001 

•000,001 

•000,001 

•000,000 

•045,641 

•039,302 

•045,612 

•039,275 

•045,646 

•039,304 

•000,000 

- l 

•000,000 


With the possible exception of the first example, in which the number of 
significant terms is small, the goodness of fit of the Pearson curves to the above 
series is quite close. The fit improves rapidly as the number of significant terms 
of the series increases, and when this number is fairly large, or, what is equivalent, 
when the standard deviation of the series is fairly large, the agreement between 
the terms of the series and th4 corresponding areas under the Pearson curve fitted 
to it by moments is sufficiently close to justify our replacing the Probability 
Integral of the series by that of the Pearson curye. 

In the first example (p 314), where the fit is not very close, the standard 
deviation is small, being approximately 2. In the following two examples the 
standard deviations of the series concerned lie between 3 and 3*5 and the corre- 
sponding fit has improved appreciably, giving an accuracy of three to four figures 
in the areas. This in itself is sufficiently close for most statistical purposes. In 



820 On Asymptotic Formulae for the Hypergeometric Series 



O. L. Davies 


321 


the last example, the standard deviation is larger, being approximately 7*9. The 
resulting fit, with the exception of the extreme tail, is surprisingly close. 

The number of significant terms of the hypergeometric series is approximately 
equal to six times the standard deviation. For series with less than 24 significant 
terms — corresponding to s.D.’s less than 4 — one would not in actual practice go 
into the labour of fitting a Pearson curve in order to determine an approximation 
to the sura of a number of terms because this sum can be obtained accurately and 
quite readily by calculating the terms directly. It is doubtful, however, whether 
such a procedure would be adopted if the number of terms is greater than 30. The 
total sura of the series is, of course, known. Hence, the probable limit where one 
would calculate the terms directly is a series with 60 significant terms, correspond- 
ing to a standard deviation of about 10. For such series, the appropriate Pearson 
curve should give at least as good a lit as in the previous example (Ex. IV). For 
instance, the Pearson curve fitted by moments to the series JP(1, —60, —65, 1) 
is (p. 318) 

y = yo#* - ' 0001,188 (63*349,729 - *)*«*•«*» (i )«*. 


The ratio of the sum of the remainder of the series after the ninth term, to the 
total sum of the series, is *399,3917. 

The approximate value of this ratio found from the curve (i) is 

r 63.849,789 / r 63.349,789 

ydx j ydx. 

This is the incomplete beta function 

^.857,0316 (5*988,8238 ; *999,8862). 

By triple interpolation into the beta function tables, using third differences, this 
was found to be *399,3800. 


This differs from the true value by less than unity in the fifth place. 


In order to appreciate what happens when the number of significant terms is 
large, the goodness of fit of a Pearson curve to the series ^(10,000; 10,000; 1; 1) 
is examined within a short range after the mode. 

Sum of series N = *224,5596 x 10 8019 . 


F is symmetrical and & is approximately equal to 3. 

/4s = 1249*979170 whence a = 35*356,0445. 

In the following table the terms of the series after the mode are compared with 
the areas under the normal curve 


y 


V27T .a 


a? 

~2<r* 

» 


which has the same mean standard deviation and sum as the above series. 



322 On Asymptotic Formulae for the Hyper geometric Series 

The quantities tabulated are equal to 10~ fl017 times their actual values. 


Terms of series 

Areas under normal 
curve 

*2253,393 

•253,382 

•253,292 

*253,281 

•252,988 

*252,488 

*251,777 

*250,872 

•249,771 

•252,977 

•252,471 

•251,765 

1 *250,861 

•249,760 

•248,475 

•248,465 

*246,989 

*246,978 

•245,316 

•245,304 

*243,468 

•243,446 

*241,422 

•241,410 

•239,211 

*239,200 

•236,831 

•236,820 


The true value of is 2 999906, which is slightly less than 3. This accounts 
for the constant deficiency of 1 in the fifth place in the areas under the normal 
curve. A slightly better fit is given by the Type II curve which has the same first 
four moments as the series. Even so, the normal curve gives sufficiently accurate 
results. 

I am indebted to Professor Pearson for suggesting this problem to me and for 
his advice and criticism throughout the preparation of this paper. 



THE BODY BUILD OF AMERICAN-BORN 
JAPANESE CHILDREN. 

By P. M. SUSKI, M.D. 

The better bodily development of American-born Japanese children over the 
children born in Japan, presumably through environmental influences, is the fact 
observed by many during the past few years. Ishiwara first called attention, in 
Japanese medical literature, to the superior height and weight of the second 
generation Japanese in America. A few years ago, I measured American-born young 
Japanese ranging in ages from 15 to 25. At the request of Dr Ishiwara, two years 
ago, I measured a hundred Japanese boys born in America. In the former group 
the height and weight only wore obtained. In the latter, the stature, sitting height, 
iliac spine height, knee joint height and weight were measured. The ages of these boys 
were 19 to 22. With the measurements of the latter group, the height and weight 
of their parents were compared, and it has been found that the children born in 
America excelled their parents by about 5 % in height and about 6 % in weight. 
(In several instances, one or both of the parents could not be measured.) 

That there is a marked difference in bodily development of the children born 
and raised under environments foreign to those under which the parents thrived, is 
found to be true by many authors. American-born descendants of immigrants differ 
in type from the foreign-bom parents, according to Boasil) who further asserts 
that, the longer the sojourn of parents in America before the children are born, the 
more intense the influence is felt. Bakwin and Bakwin(2) demonstrated a striking 
difference in the body build of infants of two groups from different social environments. 
Gray and Gower (3) found the height and weight of girls in private schools are better 
than the average. Studying the bodily growth of Chinese children in Hawaii and 
those in Chekiang and Kiangsu Provinces in China, Appleton<4) observes a marked 
influence in the manner of growth and development, such that the growth curves, 
in general, are more smooth and regular, the period of growth retardation comes in 
a later period and to a lesser degree in Hawaiian-born Chinese children than those 
bom in China. She thinks that the acceleration of growth in the stature of Chinese 
boys in Hawaii which is greater than that of boys in East China is mainly due to the 
more rapid increase in length of lower limbs, in which the height of knee is the more 
active factor. 

If lands foreign to the parents act upon the growth and development of their 
offspring, we may enquire what may be the influence ot America (i.e. Southern 
California) upon the growth of the children of Japanese parents; this was the 



824 The Body Build of Amerimn horn Japanese Children 

question I had always borne in mind while making measurements of children in 
Los Angeles, bom of Japanese parents. 

The material consists of students of private Japanese schools, which are conducted 
to supplement the public school education of the city of Los Angeles. So, practically, 
all of the children are public school children. The age range is from 6 to 19, although 
the youngest and those above 18 years of age are so few in number, that I have 
omitted them from the tables and charts. The parents of the children are from all 
walks of life, from unskilled labourers to business men, manufacturers or literary and 
professional people. With the exception of about one per cent, of the children, who 
were bom in Hawaii, those measured are California born. 

These are the first of the series of the intended annual physical measurements 
of Japanese children born in America. As yet the number is not large enough to 
show conclusive results, but the chief reasons for the present publication of my 
fipdiugs are to show at least the apparent marked difference in stature, weight, 
chest circumference, sitting height and leg length which I found, when compared 
with figures available for the Japanese children in Japan, and also to request the 
authorities in this line of work to point out any defects in procedure in my work, 
and to give me suggestions for research work in directions which I may not have 
taken. 

The children will be measured hereafter once a year at the same time of the 
year as far as possible. Thus far, the measurements were limited to the months of 
June and July. But the field will be enlarged so that other lots of children will be 
obtained for other months of the year. That the growth rate is not uniform throughout 
the year, is pointed out by Orr and Clark (6), Zeiner-Hendriksen(e), Nylin (7) and others. 
Hejinian and Hatt<8) made monthly measurements of stem-stature index. Sumner 
and Whitacre<9) followed monthly change in weight of Texas school children. Iowa 
Child Welfare Research Station do) reports various measurements made monthly 
of young children from 3 to 6 years of age. It is usual for the investigators to take 
measurements weekly or monthly in the case of infants or small children. The 
figures obtained through these frequent measurements, if computed individually, 
would certainly reveal the seasonal variation in growth and development of children. 
But the purpose of my present investigation concerns the annual growth rate. I 
have planned the examination of all children only once a year, and at the same 
season of the year for each group of children, so that the seasonal variation in 
growth would not interfere with results. 

The time of measurement was in the afternoon for one school and in the forenoon 
for others. Martin (li) recommends about 10 A.M. as the ideal time of measurement. 
Nakadate found the maximum diurnal variation in stature of children of 10 to 15 
years of age to be 1*60 cm. for boys, and 1*52 cm. for girls. He finds the maximum 
in stature to be immediately after rising in the morning, and it drops to the lower 
level around 11.30 a.m.; from then on till 8 P M. there is a very small amount of 
change (la). 



P. M. Shski 


325 


The nearest birthday system is employed in classifying the children into 
different age groups; i.e., the children from 6 years 6 months to 7 years 6 months 
of age are grouped as 7 years, and so on. When making comparison with figures 
obtained from Japan, it was necessary to make allowance of a half year, because 
of the fact that many Japanese investigators classify a child between the 6th and 
7th birthday as 7 years old, and so on. 

Instruments used are: Martin’s anthropometric set, consisting of a measuring rod 
with arms, callipers, steel tape measure and a slide compass; a box with a square 
flat top to measure sitting height; a standard spring scale. 

I have examined the lists of measurements of Martindi), Hrdlicka(l3), the Geneva 
Agreement as cited by Hrdlicka(iS), Yoshida(ia), and Lucas and Pryor(as), and found 
them to contain 30 to 70 direct measurements of the living body, exclusive of functional 
and capacity tests. As Hrdlicka points out, the number of practicable measurements 
on the human body is infinite, and these measurements may be of value if taken by 
the same method on a sufficiently large number of individuals of various groups. 
Bearing these things in mind, I made a selection of physical measurements, chiefly 
through the suggestions of Ishiwara, namely, ten measurements, nine direct and one 
indirect. They were: 1. Stature,. 2. Sitting height, 3. Height of anterior superior 
spine of ilium, 4. Leg length, 5. Knee joint height, 6. Arm span, 7. Chest circum- 
ference, 8. Intercristal width, 9. Acromial width, 10. Body weight. 

The stature is measured with a child standing on a flat floor without shoes, and 
the head erect, keeping the eye-ear line horizontal. The left anterior superior spine 
of ilium is first determined by palpation, marked with a dermatograph pencil and 
then measured. Downes ( 14 ) reports there are over 50 °/ 0 of asymmetry in the height 
of the iliac spine exceeding 0 5 cm. I found only a few cases of asymmetry, and 
these in a slight degree. 

For the knee joint height, a groove between femur and tibia is easily found 
laterally on the level between the patellar prominence and the tibial tuberosity. 
By inspection from the left side of the left knee joint, one may easily locate the 
position of the groove, and verify it by palpation and mark it with a pencil. 
The distance from the floor is then determined. Martin recommends the internal 
groove of the knee joint, and it is most subcutaneous there and therefore easiest to 
measure. But I did not meet any difficulty in locating the groove on the lateral side. 

The leg length has been computed from the figures of the anterior superior spine 
height according to Martin’s formula, and is intended to measure the height 
of the head of femur from the floor, corresponding to Martin’s “ganze Beinlange ” 
(the entire leg length). 

The sitting height is obtained by seating a child on a box not too high so that 
the child can rest the feet on the floor, and measuring the height of the vertex above 
the seat level. Gray and Boot (16) call our attention to the fact that some workers 
call this and other measurements under various names as stem-length, trunk-length, 



826 The Body Build of American-born Japanese Children 

rump-length, etc., with analogous terms in other languages, so that it may mean 
lengths from vertex, shoulder, suprasternal notch, 7th cervical vertebra or acromion 
down to os pubis, first or last coccyx, perineum, ischial tuberosities or gluteal line s. 
It is therefore necessary for one to be very careful when referring to any old 
literature, and it is also very important to make a clear statement as to exactly how 
the sitting height is obtained. The method I employed is in accord with that of 
the Geneva Agreement of 1912, used by Hrdlicka, Martin, v. Pirquet, Yoshida, 
Ishiwara and others. Dreyer(is) urges bending of the knee and pulling up slightly, 
instead of square sitting with both knee and hip joints at right angles. I tried 
Dreyer's method in a few of the children after measuring them in the usual way, 
but I failed to find the 3% difference as pointed out by Dreyer. In the first place, 
it was difficult to keep the sacrum in contact with the post, if the knees were drawn 
up as in Dreyer’s illustration, besides in nearly all of my cases (which were young 
children) the ischial tuberosities were quite subcutaneous with no appreciable 
amount of adipose tissue intervening. 

The arm span is measured with a steel tape measure, from the tip of one middle 
finger to the other, the arms being stretched to both sides of the body horizontally. 
I found it was the best way to let a child stand against the wall to take this 
measurement. 

The chest circumference is taken at the middle of the normal inspiration and 
expiration, on the level above nipples in front and below the scapular angle on the 
back. The bicristal width is obtained with a pair of callipers, the tips of which were 
firmly pressed on the widest part of the pelvic crest The acromial width is measured 
in a similar manner, only here as the respiration, shrugging of shoulders or throwing 
backward or forward of the entire shoulder will give a great change in results. I 
exercised great care in taking the diameter at the middle of respiration and keeping 
the shoulders at the proper position. v 

The body weight is measured nude, allowing only a garment weighing 200 gm., 
and after emptying the bladder. It was recorded in the nearest { pound, and later 
converted into kilograms. 

The arithmetic mean, for each age sex group, of all measurements is given in 
the Table I, with the standard deviation and probable error of each, computed 
according to the formulae : (17) 



p.e. = ± 0*6745 -JL . 

vJV 

Table II gives the relative measures to the total height or stature, often called 
the index to stature, or ratio. The relative measures are computed as follows: 

measure other t ha n Btature x 100 
stature 


Index 



Measurements of American-born Japanese Children. 


P. M. t 



.f* <py<x>cp<x>'?<x><ptpa>ec 

SSSSSSSKSSS 

§ f— 1 

s ■H+m+i +< +i +i +i +i +i +i 
g wo>oi'-®qdio«o9qo 

SSsSSJSSSSSSS 

Q 9 ® « OS o >fl 

00 IH »H f- o? CN H 
(N H (N OS « OS Xf 


i +1 +1 +1 +1 +1 +1 +! +1 +1 +1 +1 

ip ^ w ic 9 ^ o io w us i- 
(siKNcMfNfNijqnwrtrtw 


04 ® 99 ^ 00 <A lb Xf 

SSSSSf ?ssss 

+1 +1 +1 +1 -H +1 +1 -H -H +1 

» 7 'CC>«DrN 09 *-iao»-*^i^ 


<N®Xh>iQOSt'-iQ(Nn 




+1 +1+1+1 -H +14*1 +1 +1+1 

® 1£5 ^^iO®«i£ 5 ®iO 


®oos«®^aor-»oou:® 

OOOhhhh ‘ 


+1 +1 +1 +1 +1 +1 +1 

r-l ^ OJ OS 't OS O 


ONO^OifSitSDOOS 

© © © »— i t— if- ii^Hi — * if i 

44+1+1 +144+1 +1 +1 +1 +1 

99'trm99Mos9co 


.g 

8 4 S g 

Sil 


Q CO r-1 cp ac -Tf 00 , # »- 

03 (>* M 00 W* ib *b -b 


2 +i +i +i +i +i +i +i 
os if 9 i- « ^ 9 
I'iMwio- 
o ‘C cc *r> 0 i- i— 


9 H O ^ 9 re I — i- OS op 
I- (N ^ OS OS <N «? I** x*t 

+1 +1 +1 +1 +1 +1 +1 +1 +1 +1 

'MCC'tiCSQDW^fNOSQO 


i^h cc co cp <>l to 
-* V ib o oo i - ac 


+1 +1 +1 +1 +1 +1 +1 
as cc »a i~ — — co 
-+ c »b »— 1 1 i >-i os a 


»b cb cb A. t, ® ® ii iJs i 

TO®i£SQOr-<rXWxfQO 

^ ? 7 f r r 

+1 +1 +1 +1 +1 +1 +1 +1 +1+1 

? ? T ? 9 1 ‘P ® ” ® 

w a. ti o ® i' 6 « w 

*— »— < ->i 55 cc -+ -* *o «o »o 


.s « 

^ Jf>lT 

its* 

M 


ic op 9 -- I * 9 a 
4 i 4*1 *>i +i oq 


+1 +1 +1 +1 +1 +1 +1 
»0 i - cc co »p r- i- 
in 6 (>i ^ ® a 

W « « CC W CC CO 


fOif 5 «CMOS®CC<N® 

+1 +1 +1 +| +| +1 +1 +1 +1 +1 

iCHapiONaaap 


•t « X) rt I ( (S I - ip 

oq 4 i cc -b *+ -+ 


0-1 Oq f *1 p »P I - 

+1 +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 

-t 9 i t » «js « h 9 9 « 9 « 

S 3 »r^I^?^StS§o 5 Qoao 


< ?^'T ,, ? < T'T 4 T ,h T'? : ’~ i 

w£o*^M+«OOWfibM'* : + 

+1 +1 +1 +1 +1 +1 +1 +1 4H +1 

OSl-CCM(NeC(Nh»CCW 

8Sir:£®ssgss 


CO W^Wxf-fifS^rfWWW 

as cd as i- oo © i- r-< oj 
ci n cq cc ■c ^ o n 

3 +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 

m os » w ^ 9 ai ^ n as 
^ ^iosmioDWtoogs 

(O ® ® I-* 1- I" 00 OO gS w 00 


99^9(719999^ 

««TCWTC«wn«^ 

cN^prsR^pepSSoof^ 

+1 +J +1 +1 +1 +1 +1 +1 +1 +1 

^«09h^«o®oo 

S« 2 ^? 2 ^Soo§ 5 SSd 


99909,099999 
HNNW WW«xf MWW 

+1 +| +1 +1 +» +1 44 +1 +1 44 44 

OOi-tQq© 00 <Nl>»CS® 5 C©© 

388 SS?£: 588 S 


i^ 99999 « 9 ip^ 

(NiNTOffOWxJ'WWWiN 

+ 1 + 1 + 1 44 + 1 + 1+1 + 1+144 
99 ««^w«-hksw 

sssssggsss 


q m 9 9 xf 9 n 1 - « irs 9 

05 •■+ ib *b ib © 1*- t- i*- A- x* <** 

?S? 3 §$g?£SS 

J +l +1 +1 +1 +1 +1 +1 +1 +1 +1 +1 

l(? N ® H Ift a t» 1(5 H H N 

rHOqSqffOWTO"***®©©© 


9W^®w®9CNt^« 

xc«oiaeo<x>*b*b'^'xr 4 t< 

+l+l-H+|+|+|+|+(-H+l 

£ 23 i 883 l 8 $fcs 


• «w xf to®i> 


eocot^o»oua>bccfo 55 »-i 


N®®O^W«^0® 

00 iH H «H H M m ^ 

i SSSSSSS 85 S 8 




The Body Build of American-born Japanese Children 

TABLE II. 

American-born Japanese Children . Measurements relative to Stature 


Ant. Sap. Leg Knee Arm Chest Crist. Acrom. 

Spine Length Joint Span Giro. Width Width 



Those marked with an asterisk are instances in which girls excel boys in height. 
























P. M. Suski 


829 


are obtained by combining the United States Government Table of Heights and 
Weights and Holt’s Age- Height- Weight Table (18). The German figures are from the 


TABLE IV. 

Body Weight by different Authors compared. 


Bora 

Holt’s and 
U.S. Gov: 

Camerer- 

American- 

born 

Japanese 

Girls 

Holt’, and 
U.S. Gov. 

Camerer- 

American- 

born 

Japanese 

Age 

figures 

combined 

v. Pirquet 

Age 

figures 

oombined 

v. Pirquet 




om. 

om. 

cm. 


cm. 

cm. 

cm. 

7 

22-3 

23*0 

22*3 

7 

21*7 

21*0 

21*1 

8 

24-6 

25*0 

23*9 

8 

23*9 

23*0 

22*6 

9 

27*1 

27*5 

26*5 

9 

26*0 

25*0 

24*6 

10 

29*9 

30*0 

29*7 

10 

28*7 

27*0 

28*2 • 

11 

32*4 

32*5 

32*8 

11 

31*6 

29*0 


12 

35*4 

35*0 

34*8 

12 

36*2* 

32*0 

38-1* 

13 

39*3 

37*5 

39*5 

13 

40*8* 

37*0 

39-8* 

14 

44*0 

41*0 

46*3 

14 

45*1* 

43*0* 

43*1 

15 

49*4 

45*0 

49*9 

15 

48*6 

48*0* 

43*4 

16 

55*5 

— 

53*0 

16 

| 

51*1 

52*0 

45*7 


* Those marked with an asterisk are instances in whioh girls excel boys in weight. 


Age 


Age 




Fig. 1. Standing Height of AmerioauB. (U.S. Govern- 
ment figures and Holt’s combined.) 


Fig. 2. Standing Height of European Children. 
( Camerer- v. Pirquet figures.) 


Biometrika xxv 


22 








380 The Body Build of American-born Japanese Children 


RH- 7 8 9 10 II 12 13 14 15 16 ?"• 7 6 9 10 II 12 13 14 15 16 


Fig. 3. Standing Height of Japanese. 

Fig. 

4. Standing Height of American -born 

(Education Ministry figures.) 


Japanese. 

Boys’ Agk 


Girls’ Age 

PE- 7 8 9 10 11 12 13 14 15 16 

17 f)i — i 1 1 1 1 1 r 1 1 1 — i 

om. 7 

1 70. — r — 

8 9 10 11 12 13 14 15 16 


American -bom Japanese 

Americans (U.8. Government figures) 

Fig. 6. Standing Height of American-born Japanese 
and Americans oompared. 


* American-born Japanese 

... . Americans (U.S. Government figures) 

Fig. 6. Standing Height of American-born Japanese 
and Americans compared. 







P. M. Suski 


331 


AG£ Aos 



- Boys Girls Boys Girls 


Pig. 7. Body Weight of Americans. (Holt’s and Fig. 8. Body Weight of European Children. 

13. S. Government figures combined.) (Caraerer-v. Pirquet figures.) 


Age Age 



Boys Girls Boys Girls 

Pig. 9. Body Weight of Japanese Children. Fig. 10. Body Weight of Amerioan-born Japanese. 

(Education Ministry figures.) 


Age-Length- Weight Table by Caraerer and v. Pirquet ( 19 ). The Japanese figures are 
from the table issued by the Japanese Ministry of Education (20). As seen in Figs. 5 
and 6, the stature of American-born Japanese boys is very close to that of American 
boys at least up to 16 years of age, while there exists a wide difference between 
the former and that of Japan-born Japanese (see Fig, 11). The stature of Amerioan- 
born Japanese girls is nearly the same as that of American girls up to the age 
of 12, after which it drops off a little, but when compared with the figures of Japan- 
bom Japanese girls, there is quite a space above the latter. The American-born 

92-3 







382 The Body Build of American-born Japanese Children 


Age ^ ge 



Fig. 11. Standing height of American-born Japanese Boys Fig. 12. Standing Height of American-born Japanese Girls 
compared with Japan-born Japanese Boys. compared with Japan-born Japanese Girls. 


Japanese also excel their cousins in Japan in body weight and chest circumference 
(see Table V and Figs. 13 — 16). 

It is a well-known fact that the children from infancy to 11 and 12 years of age, 
grow up without showing a marked difference with respect to sex. Between 11 and 
14, corresponding to the pre-puberty period, girls excel boys in stature and weight, 
after which boys attain superior weight and height. Our tables and figures show 
these changes in all groups. But the relationship between girls’ curves and boys’ 
curves, which is of slightly different character in different groups, is best studied 
on the special diagram I have prepared for the purpose. First, I took the boys’ 
height or weight at each age as a standard and figured out a percentage for girls 
at that age. Thus, for the stature, 

Girls’ Mean Stature at given age x 100 
Boys’ Mean Stature at the same age ’ 

and called it “the percentage of girls’ stature on that of boys.” In the same manner, 
the body weights of boys and girls are compared (Tables VI and VII, Figs. 17 and 18). 
The boys’ figures are represented by a straight line at the 100% level, and the girls’ 
curve runs below it while the boys’ figures are greater, but when the girls’ figures 
surpass that of boys, the curve runs above the boys’ line. When I first drew 
this type of diagram, I showed it to Drs Lucas and Pryor, who thought it was 
unique and excellent in indicating the change in relationship of boys* and girls’ 
figures so dearly. Later, however, I found Collins and Clark(ai) had already devised 












334 The Body Build of American-born Japanese Children 



Fig. 18. Body Weight of American-born Japanese Boys Fig. 14. Body Weight of American-born Japanese Girls 

compared with Japan-born Japanese Boys. compared with Japan-born Japanese Girls. 


Aoje 



Fig. 15. Chest Circumference of American-born Japanese 
Boys compared with Japan-born Japanese Boys. 


Agp 



Fig. 16. Chest Cironmferenoe of American-born Japanese 
Girls compared with Japan-born Japanese Girls. 


a somewhat similar diagram in 1929*. I have also figured out the percentage of 
girls' other relative values on those of boys, namely, the relative sitting height, 
relative iliac spine height, relative leg length, relative knee joint height, relative 
arm span, relative chest circumference, relative bicristal width, relative acromial 
width. Among these, I could only make comparison with values of Japanese in 
relative sitting height, relative leg length, and relative chest circumference, apart 
from stature and weight (see Tables VI, VII and VIII, also Figs. 17, 18 and Figs. 
19 ft— 266). 

* [This seems merely a diagram of the customary sex-ratio multiplied by 100 ; the sex-ratio is very 
familiar to anthropometrioians and craniometrioians; Ed.] 




P. M. Suski 


335 


TABLE VI. 

The Percentage of Girls’ Standing Height to that of Bogs. 


Age 

U.8. Gov. and 
Holt’s Figures 
combined 
Americans 

Camerer- 
v. Pirquet 
Germans 

American-born 

Japanese 

Eduoation 

Ministry 

Japanese 

7 

99*5 

98-3 

100*1 

98-6 

98-6 

98- 8 

99- 0 

100*0 

100-9 

100-2 

98-3 

96*7 

94-2 

8 

99-6 

98-3 

98-4 

9 

10 

99*2 

99*2 

98-4 

98*5 

98- 3 

99- 1 

11 

99*9 

98*5 

99-6 

12 

101 ’9 

99-3 

102-5 

13 

101-4 

100-3 

99-1 

14 

99*4 

101-3 

96-0 

16 

97-7 

100*6 

93-2 

16 

94-4 


94-0 


TABLE VII. 

The Percentage of Girls’ Weight to that of Boys. 


Age 

U.S. Gov. and 
Holt’s Figures 
combined 
Americans 

Camerer- 
v. Pirquet 
Germans 

American* bom 
Japanese 

Education 

Ministry 

Japanese 

7 

97*1 

91-3 

94-6 

96 *9 

96'2 

96'1 

97*2 

99*6 

103-0 

103-5 

100-8 

90-2 

93-4 

8 

98-3 

92*0 

94*6 

9 

96*2 

90-9 

92-8 

10 

96-0 

90*0 

95-8 

11 

97*4 

89*2 

92-1 

12 

102*3 

91-4 

. 109-6 

13 

104-0 

98-7 

100-8 

14 

102-4 

104-9 

93-1 

16 

98*4 

106-7 

87-0 

16 

91-9 


86-0 




Children 


Acs 



This diagram shows percentage values of girls’ standing 
height, taking the boys’ standing height as the standard or 
100 °/ 0 for each age. The figures published by the Ministry of 
Education of Japan are based upon age groups taken from one 
birthday to the next. For this reason, the points are marked 
on the diagram half-way between ages. 


Agb 

7 6 9 10 II 12 13 14 15 16 17 



Fig. 18. The Percentage of Girls’ Body Weight to that of Boys. 


German (Camerer-v. Pirquet) 

American (U.S. Government-Holt’s) 

Japanese (Education Ministry) 
Japanese (American-born) 


TABLE VIII. Comparison of Relative Sitting Heights of 
Japan-bom and American-born Japanese Children. 


Age 

Japanese in Japan 
(Figures of Drs Tsurumi, 
Nakadate, Yagi and 
Toyoda combined) 

American -born 
Japanese 


Boys 

Girls 

Boys 

Girls 

7 

57*5 

67-6* 

65*7 

65*3 

8 

57*2 

57*2 

54*5 

54'9* 

9 

56*4 

56*5* 

54*0 

64*2* 

10 

56*0 

56*4* 

f)3-6 

53*5 

11 

55*4 

55*9* 

63*2 

63-5* 

12 

54*8 

55>6* 

53*1 

53*3* 

18 

54*0 

55'!* 

52*7 

53*6* 

14 

53*6 

55-3* 

52*6 

54*3* 

15 

53*8 

55-4* 

52*0 

64*2* 

16 

54*1 

66 ‘ 6 * 

53*0 

54*9* 

17 

64*0 

58 - 0 * 

53*6 

— 


Those marked with an asterisk are instances in whioh girls excel boys in sitting height. 






Index Index 


P. M. SUBKI 


337 


Relative Sitting Height of American-born Japanese 
and Japan-born Japanese compared 

Agb 



Fig. 19 a. 


Percentage of Girls’ Relative Sitting Height to 
that of Boys 

Agb 



Relative Height of Anterior Superior Iliao Spine 
Iliac Spine Height iAA 
Stature xl °° 

Aoe 


7 8 9 10 II 12 13 14 15 16 17 



Percentage of Gills’ Relative Height of Anterior 
Superior Iliac Spine to that of Boys 

Age 



Fig. 20 a. 


American-born Japanese. 


Fig. 206. 


Relative Leg Length, or x 100 

Agb 



Percentage of Girls’ Relative Leg Length 
to that of Boys 

Agb 



Japan-born Japanese 

(These figures are measurements of Drs Mishima and 
Minayoshi combined.) 

Fig. 21 6. 


American-born Japanese and Japan-born Japanese. 






Index 


838 The Body Build of American-born Japanese Children 


Percentage of Girls' Relative Knee Joint Height 
to that of Boys 


Relative Knee Joint Height, or x 100 


7 6 9 10 II 12 13 14 15 16 17 




Fig. 22 a. 


0/ 7 8 9 10 It 12 13 14 15 16 17 

1 0 ri — i — 1 — ^"~r — i — i — i — r — i — r 
102- 



American-born Japanese. 


Fig. 22Ii. 


Relative Aim Span, or ^- pftn x 100 
* Stature 


7 8 9 10 II 12 13 14 15 16 17 

n i r i i — r “i — i — t - r — n 



Fig. 28 a. 


Percentage of Girls’ Relative Arm Span to that of Boys 
Age 

7 8 9 10 II 12 13 14 15 16 17 

p — j — , j 1 1 r | 1 1 r 

10 

101 - 




American-born Japanese. 


Pig. 286. 


Relative Chest Circumference. 
American-born Japanese 
Age 


7 8 9 10 



12 13 14 15 16 


Relative Chest Ciroumference. Japanese 
Age 

7 8 9 10 II 12 13 14 15 16 



Fig. 24 a. 


Fig. 24 b. 






P. M. Suski 


Percentage of Girin’ Relative Chest Circumference 
to that of Boys 

Age 



Relative Bioristal Width, or x 100 

stature 


Age 


18r ' 

17 - 

i6_ . 

15-' 

14 - 

13“ 


9 10 11 

-i — i r 


12 13 14 15 

i r i i 


16 17 

f 17' 


Boys 


Girls 

Fig. 25 a. 


Percentage of Girls’ Relative Bioristal Width to that of Boys 
Age 



American- bom Japanese. 


Fig. 25 6. 


„ , . 4 , nr ., iL Acromial Width 1AA 

Relative Acromial Width, or g tftturo * 100 

Age 



Fig. 26 a. 


Percentage of Girls’ Relative Aoromial Width to that of Boys 
Age 


American -born Japanese. 






340 The Body Build of American-born Japanese Children 

The curves of the percentages of girls 1 stature and weight to those of boys 
(Tables VI, VII, Figs. 17, 18) run the same general course, whether they be for 
Americans, Japanese, American-born Japanese or Europeans. The curves show the 
pre-puberty upheaval over the boys’ 100 % line between the ages of 11 and 14 years 
in stature, and between ]2 and 15 years in weight. It is noticeable that the curve 
of the American-born Japanese is slightly irregular on account of the smaller 
number of observations. Figures for Americans as well as for Japanese are based on 
measurements of millions ; those of Germans are thought to be equally large ; but 
those of American-born Japanese are only a little over a thousand. That the curves 
of the American-born Japanese girls show a sudden and rapid drop after reaching 
a maximum at 12 years, may be explained as the general poor physical condition 
of the girls of 13 years or more. After making measurements of the same group 
of children a number of times in the future, I may be able to offer a better ex- 
planation. The curves of the German girls’ acceleration tend to be delayed for about 
2 years. This is probably caused by the delay i n puberty on account of higher latitude 
and colder climate. 

The relative sitting height of American-born Japanese is remarkably low when 
compared with that of Japan-boms. It is known that this index is highest at birth 
and goes down with age until it reaches the lowest level at a definite age and then 
again goes up a little. Bean (22) calls this index a “skeletal index” and finds it 
inverse in stature (that is smaller in taller people than in shorter). He found Whites 
and Filipinos to have about the same index and Negroes smaller. The skeletal 
index, according to Bean, reaches the minimal level between ages 10 to 16 and 
9 to 14 in American boys and girls respectively, and between 8 to 16 and 8 to 12 
in German- American boys and girls respectively. The minimal level for American- 
born Japanese boys is reached at 15, and the same for Japan-born Japanese boys 
at 14, the minimum for girls being reached a year or two earlier Jn either case. 
(Table VIII and Fig. 19 a.) 

The percentage of girls’ relative sitting height is nearly at 100 % level at 7 years 
but it rises gradually till about 15 years of age, to the level between 103 and 104 
in both the American-born and Japan-bora Japanese. (Table IX and Fig. 19 b.) 

The relative height of anterior superior iliac spine goes up from about 54 at 7 
years to around 56 at 10 years of age, from then on there is no marked change, 
only the girls’ figures drop somewhat after 13 years of age. 

The percentage of girls’ relative height of anterior superior iliac spine to that 
of boys is nearly at 100% mark up to 11 years, then it drops off gradually. (See 
Table IX and Fig. 20 b.) 

The relative leg length of American-born Japanese is, in general, greater than 
that of Japan-born by 4*0 to 4*5, the general tendency of both being a gradual 
climb with age up to 10 or 11 years, after which it stays rather stationary. (Table X 
and Fig. 21 a.) 

The percentage of girls’ relative leg length to that of boys runs just about the 



P. M. Suski 341 

TABLE IX. The Percentage of Girls ’ Relative Measures to those of Boys . 


Age 

Sitting 

Height 

Iliac Spine 
Height 

Leg 

Length 

Knee Joint 
Height 

Arm 

Span 

Cheat 

Ciroumf. 

Orietal 

Width 

Acromial 

Width 

American-born Japanese Children 

7 


100*72 


m 


98-71 

98*81 

101*14 

8 


99*91 

99*87 


99*99 


104*35 

101*99 

9 

100*35 

98-78 

99-74 

■Kfl 

98*94 

99*23 

102*76 

102*47 

10 

99*78 

99*82 

100-36 

98*88 


99*02 

101*93 

100*55 

11 

100*51 

100*11 




97*66 

102*83 

102*53 

12 


99-66 


100*21 

100*04 

97*28 

105*49 


18 


98*94 

98*89 

96*32 

98*76 

101*59 

106*85 

103*54 

14 

103*13 

98*46 

99*48 


99-03 

100-23 

104*15 

104*15 

15 

104*18 

99*42 

100*38 

97*48 


100*79 

108-57 

104*99 

16 

103*47 

98*23 

97*96 

96*66 

99*72 

98*55 

107*19 

100*16 

Japan -born Japanese Children 

74 

100*16 


99-2 




98*1 

_ 

_ 

b| 

100*10 

— 

99*2 

— 

— 

97*8 

— 

— 

9 $ 

100*30 


100*6 

— 

— 

97*7 

— 

— 

104 

100*70 


100*6 

— 

— 

97*0 

— 

— 

lit 

100-79 

— 

100*4 

— 

— 

96*8 

— 

— 

12$ 

101*21 


99*6 

— , 

— 

97*6 

— 

— 

13$ 

101*92 


99*8 

— 

— 

98*6 

— 

— 

14* 

103*15 

— 

99*8 

— 

— 

101*3 

— 

— 

16i 

102*97 

— 

— 

— 

— 

101*8 

— 

— 

16} 

102*62 

— 

— 

— 

— 

— 

— 

— 


These are from 

These are from the figures of 

These are from the figures 


the figures of 

Drs Mishima and Minayoshi 

published by the Ministry 


Drs Tsurumi, Naka- 


combined 


of Education, Japan 


date, Yagi and 








Toyoda combined 








TABLE X. Relative Leg Length of American-born Japanese Children 
compared with that of Japanese Children in Japan . 





Japanese in Japan. 


American-born 

Measurements of 


Japanese 

Drs Mishima and 

Age 



Minayoshi 

combined 


Boys 

Girls 

Boys 

Girls 

7 

52-3 

62*7 

49*2 

48*8 

8 

53*4 

53*4 

49*3 

48*9 

9 

53-7 

53*6 

49*5 

49*8 


54*6 

54-8 

49*6 

49*9 

11 

54*7 

55*1 

49*7 

< 49*9 

12 

54*6 

54*7 


49*8 

13 

54*9 

54*3 

50*0 

49*9 

14 

54*6 

54*2 


49*9 

15 

54*2 

54*4 

— 

— 

16 

54*3 

53*2 



























842 The Body Build of American-born Japanese Children 

same course in American-born as in Japan-born Japanese. The girls excel boys 
between 9J and 12 years of age. (See Table IX and Fig. 21 6.) 

The relative knee joint height of American-born Japanese shows a slight increase 
with age. The percentage of girls’ relative knee joint height to that of boys has 
a downward tendency, being above the boys’ level at 7 years and dropping down 
to about 965 % at 16 years of age. (See Table IX and Fig. 22 b.) 

The relative arm span of American-born Japanese climbs up from .98 to 102 in 
the course of years 7 to 17. The values for boys and girls are nearly the same. 
(See Table IX and Figs. 23 a — b.) 



Fig, 27. 





P. M, Suski 


343 


The relative chest circumference takes a course first down and then up, the 
lowest level being at 10 to 11 years in the case of American-born and at about the 
13th year in case of Japan-born Japanese. There seems to be a greater difference 
between boys’ and girls’ figures in Japan-bom Japanese than those of American-born 
Japanese. (Table IX and Fig. 24 c.) 

The relative bicristal width and the relative acromial width of American-born 
Japanese are kept almost at the same level all the way through the ages 7 to 17. 
The percentage of girls’ relative values to those of boys is always above the boys* 
line, and the general tendency is a steady climb. (Table IX and Figs. 25 b t 26 6.) 

TABLE XI. 

American Native White Stock ( Third Generation Native-born ). 

Collins and Clark, U.S. Public Health Reports 44 ; 18, 1059, 1929. 


Age 

Standing Height 
(cm.) 

Sitting Height 
(cm.) 

Chest Circnmf. 
(cm.) 

Weight 

(ke-) 










Boys 

Girls 

Boys 

Girls 

Boys 

Girls 

Boys 

Girls 

7 

119-5 

118-5 

63-8 

63*3 

59*3 

68-1 

22-0 

22*1 

8 

125*1 

124*3 

66*0 

65-5 

60*9 

59*1 

25-1 

24*3 

9 

130*2 

129*4 

68-0 

67-5 

62*5 

61 -3 

276 

27*1 

10 

135-1 

134-5 

09-8 

69-5 

64-3 

63*2 

30 4 

29*8 

11 

139*9 

140-1* 

71-5 

71-9* 

86-2 

65-8 

33-3 

33*7* 

12 

144-4 

140-0* 

73*3 

74-4* 

68-1 

68-2* 

36-4 

37-8* 

13 

1500 

162-0 

75-5 

77-6* 

70-7 

71-1* 

40*5 

43-0* 

14 

155-5 

155-5 

78-2 

79-0* 

73-6 

73*3 

45*1 

40-8* 

15 

158-9 

157*0 

79-8 

80-8* 

74-4 

73-7 

473 

49-5* 

Actual Mean Annual Increment 

7- 8 

5-0 

5’8* 

2-2 

2-2 

1 6 

1-3 

2*6 

2*2 

8— 9 

5-1 

5-1 

2-0 

2*0 

1-6 

1*9* 

2*5 

2*8* 

9—10 

4*9 

6-1* 

1-8 

2-0* 

1-8 

1-9* 

2*8 

2*7 

10-11 

4-8 

5-6* 

1-7 

2-4* 

1-9 

2-6* 

2*9 

3*9* 

11—12 

4-5 

6-9* 

1-8 

2*5* 

1-9 

2-4* 

3*1 

4*1* 

12—13 

5-6 

0-0* 

22 

3*2* 

2*6 

2-9* 

4-1 

5-2* 

13-14 

5-6 

3-5 

2-7 

2*0 

2-9 

2-2 

4*6 

3-8 

14—15 

3-4 

1-5 

1-6 

1*2 

0*8 

0*4 

2-2 

2-7* 

Percentage Increase per year 

7— 8 

4-72 

4-87* 

1 


2-84 

2-25 

11*03 

9-98 

8— 9 

4*08 

4"ll* 

mm\ 

2*97 

2*63 

3-28* 

9*96 

11-57* 

9—10 

3-76 

3-94* 

2-65 

2-97* 

2*85 

3 05* 

10*05 

10-27* 

to— 11 

3-54 

4-20* 

2*45 

3-53* 

2-98 

4-17* 

9*77 

13-02* 

11—12 

3*19 

4-19* 

2-48 

3-61* 

2*98 

3-67* 

9*09 

12-17* 

12—13 

3*92 

4-13* 

3*11 

4-23* 

3-65 

4-26* 

11*51 


13—14 

3*67 

2-28 



4*18 

2*98 

11*37 

8-83 


* Those marked with an asterisk are instances in whioh girls exoel boys. 






844 The Body Build of American-born Japanese Children 

The sitting height and the anterior superior spine height are nearly the same 
in some children, but not in all. A child may be either one of these, i.e., these 
measurements are nearly equal, or the iliac spine height is greater, or the sitting 
height is greater. Whether these types, if they may be so called, have something 
to do with body types of children, is yet to be determined. For the present, I will 
just divide the boys and girls of the American-born Japanese into these three 
types, viz.: 

1. Iliac spine plus type. (Where the iliac spine height is greater than the 
sitting height.) 

2. Equivalent type. (Where they are equal within 0*5 cm.) 

3. Sitting height plus type. (Where the sitting height is greater than the iliac 
spine height.) 


Annual Increment in Standing Height 



Actual Annual Inorement in Sitting Height 

om. 

4 
3 
2 
1 



Actual Annual Increment in Chest Circumference 



Actual Annual Increment in Body Weight 



Fig. 28. American Native White Stock. 







845 


P. M. Suski 


TABLE XII. American-born Japanese ( Second Generation). 



* Those marked with an asterisk are instanoes in which girls excel boys. 


Biometrika xxv 


28 
































840 The Body Build of American-born Japanese Children 

The percentage of the number of each type at each age and sex is given in 
Fig. 27. So far I have not seen reports of this nature by other investigators. 
1 cannot therefore make any comparison or draw any conclusion. One may see, 


Actual Annual Increment in Standing Height 




Aotual Annual Increment of Ohest Circumferenoe 

om. 



Kg- 



Fig. 29. American-born Japanese Boys and Girls. 


however, that the maximum of iliac spine plus type in the boys is reached at ages 13 
to 15, while the maximum of the same type for the girls is reached at ages 10 to 12, 
though less pronounced in degree. The equivalent type seems to have no significance. 






P. M. Suski 


847 


Collins and Clark{8i) published valuable data on American native white stock 
(third generation native born children), for which they calculated actual mean 
annual increment and annual percentage of increase, in stature, sitting height, chest 
circumference and body weight. (See Table XI and Fig. 28.) Girls seem to have 
greater actual annual increment than boys during the ages between 10 and 18. 
These figures are based on measurements of 28,674 children. My figures on the 


Actual Annual Increment in Anterior Superior Spine 



coo© — 

I ITT i T T T T 

cm. t^.ooo'© — 



Actual Annual Increment in Knee Joint Height 

cm. 



Fig. 80. Amerioan-born Japanese. 


measurement of a little over 1,000 children may not be worth much, but I publish 
them here (Tables XII and XIII, Figs. 29, 30 and 31), simply to show that at least the 
girls' annual increment is decidedly greater than that of boys in all measurements, 
at the age of 12. 

In conclusion, I have computed the index of measurements of the American-born 
Japanese, taking the figures of the Japan-born Japanese as a standard, thus: 


Measurement of Americ an-born x 100 
~~ Measurement of Japan-bom 


28— a 






"#48 The Body Build of American-born Japanese Children 

and found that, in stature, the American-born are higher by 6 to 7 %, although 
girls drop to at the age of 14£. The relative sitting height of Amerioan-born 
is below that/of Japan-born by 4% up to 11£ years, after which it climbs up to 
about — 2^ The relative leg length of American-born is more than 6 % greater 
at 7 than that of Japan-born, and rises to 10 % at 11^ years. 

/ 

Actual Annual Increment in Arm Span 



cm 

5 

4 

3 

2 

1 

0 



Aotual Annual Increase of Bioristal Width 


cm. 



Boys Girls 


Fig. 81. Amerioan-born Japanese 

The chest circumference of American-born climbs from 1 % plus to 9 % above 
that of Japan-born at the age of 14J in case of boys, while the girls* figures are 
generally higher, i.e., rise from 7 % at 7 years to over 10 % at 12 J years, then drop 
to 5 % at 14£ years. 

The body weight of American-born ranges from 19 to 26% above that of 
Japan-born in the case of boys. The girls* figures are lower than this by 2 to 3 %, 
but the lowest level at 14£ years is still over 9% above that of Japan-born. 
(Table XIV and Figs. 32, 33.) 

Taken as a whole, the difference in body build of the American-born Japanese 





P. M. Suski 


340 


TABLE XIV. 

Index, Values of Amerioan-born Japanese, with Values of Japan-bom 
Japanese Children taken as a Standard. 

The figures of the American-born Japanese are increased by $ annual increment 
in order to raise to the age of the Japanese whioh is year+J. 


Age 

Stature 

Bel. Sit. 

Bel. Leg 

Cheat Cire. 

Weight 

Boys 






74 

100*00 

96-87 

106*30 

101-25 

119*69 

84 

106*63 

95-28 

108-32 

100-86 

119-43 

94 

106-88 

95-75 

108-48 

104-00 

121-64 

104 

107 13 

95-71 

110-08 

105-50 

124-70 

III 

106-74 

96-03 

110-06 

106-42 

123-36 

124 

107*17 

96-90 

109-45 

108-70 

123-59 

134 

108-22 

97-59 

112-35 

108-92 

126-18 

14 f 

107-54 

98-14 

109-00 

109-50 

122-65 

Average 

7*04 

- 3-47 

9-26 

5-64 

22*66 

Girls 






74 

106*82 

96-01 

108-00 

107-01 

118*38 

84 

106-38 

95-99 

109*20 

106-81 

116-26 

94 

106-89 

95-93 

107-63 

106-77 

118-92 

104 

107*20 

94-86 

109-82 

107*24 

119-67 

114 

107-90 

95-71 

110-42 

109-38 

125-28 

124 

107-03 

96-04 

109-84 

110-70 

125-48 

134 

105-20 

97-28 

108-82 

108-44 

117-90 

14{ 

103-52 

98-19 

108-62 

104-77 

109*34 

Average 

6-37 

- 3-75 

9-04 

7*64 

18-90 

Average for) 
both sexes J 

6*7 

-3-6 

9-15 

6-64 

20-8 


(the so-called second generation Japanese, as their parents are of Japan birth) from 
that of Japan-bom children is, in round numbers, 7 % higher in stature, 4% shorter 
in relative sitting height, 9 % greater in relative leg length, 7 % greater in chest 
circumference, and 20 % heavier in weight. 

Both parents of these children were full-blooded Japanese born in Japan in all 
instances. There were only a few pupils of mixed Japanese and Caucasian parentage 
in the two schools where measurements were made, and they were not included in 
the present investigation. Even here in America, the intermarriage of Japanese 
with other races is very rare, as the Japanese community is fairly large and Eurasian 
children are unusual. 

The children in this investigation are certainly growing faster, that is taller and 
heavier, age for age, in comparison with children in Japan. It seems very likely 
that they will grow up to be taller and heavier adults than native born Japanese, 
inferring from the height and weight attained by these children at 16 or 17 years 
of age, which were already greater than those of adult Japanese. 




350 The Body Build of A merican-born Japanese Children 

A few years ago, I measured a hundred young Japanese of American birth 
whose ages were 19 years and upwards, and found them to surpass their fathers in 
height by an average of more than 3 inches* When 1 have measured the children 
of the present study annually for a few years, I expect to be able to report more 
definitely what their ultimate stature and weight will be. 

Aok 


Percentage of Stature of American-born 
Japanese over Japan-born 


Level of Japan-born Japanese 

Pereentage of Relative Sitting Height of 
Amerioan-born Japanese below that of 
Japan -born 


Percentage of Relative Leg Length of 
Amerioan-born Japanese over that of 
Japan-born Japanese 


Level of Japan-born Japanese 

Boys Girls 

Fig. 82. 

Prof. K. Pearson tells me in a personal communication of his experience with 
two tall Japanese, who said their stature was quite usual in the case of the Japanese 
island from which they came. I am not aware of any island in Japan which is 
noted for a higher stature of its inhabitants. As far as I know, the people of Japan 
cannot be divided into racial groups, with the exception of the Ainus in the extreme 
north. The latter are not included in the present study. Parents of the children in 
this work come from all parts of Japan, and they may be regarded as representing 
the pure Japanese race. The Japanese Government measurements were performed 
under the supervision of Dr Yoshida, who follows Martin’s method as I do. The 




P. M. Suski 


851 


Government measured the school children from all over Japan, that is Japan proper, 
where no different races figured, annually for a number of years. Therefore I think 
I am justified in making a comparison between my measurements and those of the 
Japanese Government. 


Age 



Percentage of Chest Circumference of 
American- born Japanese over that of 
Japan-born Japanese 


Level of Japan-bom Japanese 


Percentage of Body Weight of American- 
born Japanese over that of Japan-bom 
Japanese 


109 — 


108 — 

107 — 

106 — 

105 — 

104 — 

103 — 

102 — 

101 — 

100 Level of Japan-born Japanese 


Boys Girls 

Fig. 38. 

No appreciable difference is shown in stature and weight between social and 
labouring classes in Japan. Of course, the weight and stature must be greater in 
better nourished children of well-to-do families when compared with those of 
paupers. In any case, my figures as well as those of the Japanese Government 
measurements cover all classes. 



852 The Body Build of AmericcmJiorn Japanese Children 


LITERATURE CITED. 

(1) Boas, Franz. “Changes in the Bodily Form of Descendants of Immigrants.” American 

Anthropologist, Vol. XIV, pp. 42 — 53, 1927. 

(2) Bakwin. Harrt and Bakwin, R. M. “Body build in Infants.” Journal of Clinical 

Investigations , Vol. X, pp. 369 — 402, June, 1931. 

(3) Gray, H. and Gower, C. “Growth Standards of Height and Weight for Girls in Private 

Schools.” American Journal for Diseases of Children , Vol. xxxv, pp. 411 — 413, March, 

1928. 

(4) Appleton, Vivia B. “Growth of Chinese Children in Hawaii and in China.” American 

Journal of Physical Anthropology , Vol. x, p. 237, 1927. 

(5) Orr, J. B. and Clark, M. L. “Seasonal Variation in the Growth of School Children.” 

Lancet , Aug. 16, 1930. 

(6) Zeiner-Hendriksen, K. “Growth of Children iu Summer.” Norsk Magazin f Laegevidenskab , 

Vol. lxxxi, p. 262, March, 1920. 

(7) Nylin, G. “Periodic Variation in Growth.” Acta medica Scandinavica , Supp. 31, pp. 1 — 207, 

1929. 

(8) Hejinian, L. and Hatt, E. “The Stem -length ; Recumbent Length Ratio as an Index of 

Body Type in Young Children.” American Journal of Physical Anthropology , Vol. xm, 
pp. 287 — 307, July, Sept. 1929. 

/ 9) Sumner, E. E. and Whitacrk, J. “Some Factors affecting acouraoy in the collection of 
data on the Growth in Weight of School Children.” Journal of Nutrition , Vol. iv, p. 15, 
May, 1931. 

(10) Iowa Child Welfare Research Station. “Physical Traits of Young Children.” American 

Journal for Diseases of Children , Vol. xxxvm, p. 541, Sept. 1929. 

(11) Martin, R. Anthropometrie. 

(12) Yoshida. Physical Measurements. 

(13) Hrdlicka, A. Anthropometry. 

(14) Downes, R. M. “The Interrelationship of some Trunk Measurements and their relation to 

Stature.” Journal of Anatomy and Physiology , Vol. xlviii, p. 299, 1913 — 1914. 

(15) Gray, H. and Root, H. F. “Stem-length and Trunk-length.” Boston Medical and Surgical 

Journal , Vol. olxxxiv, p. 439, 1921. 

(16) Drkyer, G. Assessment of Physical Fitness. 1921 . v 

(17) Bovard, J. F. and Cozens, F. W. Tests and Measurements in Physical Education . 1931. 

(18) Holt, L. E. The Diseases of Infancy and Childhood. 

(19) Lust, F. Diagnostik u. Therapie d. Kinder hr ankheiten. 

(20) Yoshida. “Standard weight of Japanese School Children.” Japan School Hygiene f Vol. xix, 

p. 12, Dec. 1931. 

(21) Collins, S, D. and Clark, T. “ Physical Measurements of Boys and Girls of native White 

race Stock (third generation native bom) in the United States.” Public Health Report , 
Vol. xliv, p. 1059, May, 1929. 

(22) Bean, R. Bknnet. “The adult Sitting Height. The Sitting Height in Children.” Anatomical 

Record , Vol. xvm, p. 222, 1920—1921. 

(23) Lucas, M. P. and Pryor, H. B. “Physical Measurements and Physiologic Processes in 

Young Children.” Journal of American Medical Association, Vol. xovn, p. 1127, 1931. 



METHODS OF STATISTICAL ANALYSIS APPROPRIATE 
FOR k SAMPLES OF TWO VARIABLES. 

By E. S. PEARSON, D.Sc. and S. S. WILKS*, Ph.D. 


CONTENTS. 


I. Introduction 353 

II. Derivation of the Criteria 366 

III. Interpretation of the Criteria ........ 369 

IV. The Moment Coefficients and Distributions of the Criteria . . . 364 

V. Practical Illustrations 369 

VI. Conclusion 376 

VII. Appendix 376 


I. Introduction. 

(1) The Testing of Statistical Hypotheses . Statistical theory which is not purely 
descriptive is largely concerned with the development of tools which will assist 
in the determination from observed events of the probable nature of the under- 
lying cause system that controls them. The measured characteristics of quality 
vary from unit to unit, and the statistical technique is required to analyse this 
variation and covariation to break it into parts with which may be associated 
assignable causes, to test and compare alternative hypotheses, and to express the 
resulting conclusions in terms of measures of probability. It will be found that 
some of the most recent generalisation of theory has resulted from an attempt to 
provide critical tests of increasingly complex hypotheses. We may trace the develop- 
ment through a chain of questionings : Is it likely, (a) that this sample has been 
drawn from a specified population, P ; ( b ) that these two samples have come from 
a common but unspecified population ; (c) that these k samples have come from a 
common but unspecified population? Again the population P may be (d) com- 
pletely specified or, ( e ) only partly specified, e.g. its mean is given but not its 
standard deviation ; or when there are a number of samples we may allow the 
means in the sampled population to be different and question whether the standard 
deviations are the same. Another line of advance is from (f) problems dealing only 
with a single variable, to (g ) those in which there may be a number of correlated 
variables. 

Now we may frankly admit that in so far* as the technique is to be used in 
handling data broken up into small groups, the recent theoretical developments 
assume normal variation. But to place the procedure for testing statistical hypo- 
theses on a firm logical basis under one set of simplified conditions, is in itself an 

# International Research Fellow in Mathematics. 



354 Analysis Appropriate for k Samples of Two Variables 

achievement of some value, and perhaps the most practical line of advance is the 
following : 

(a) To establish what we may terra “normal theory.” 

( b ) To study in a more systematic way than has been attempted the extent 
ot departure from normality met with id different fields of application. 

(c) To examine how rapidly normal theory tests become inefficient as the 
form of variation and covariation departs from the normal, and to determine the 
nature of the errors in judgment that will arise if these tests are still used. 

(2) The Analysis of Variance . R. A. Fisher’s methods of Analysis of Variance 
may be regarded from the following viewpoint: 

(a) In any given problem it will generally be possible to specify certain factors 
which may be the cause of part of the variation, while there will be a residual part 
which, in the state of knowledge at the moment, must be regarded as due to un- 
identifiable or chance causes. 

(b) An experiment may be designed to test whether a certain factor is opera- 
tive or not ; for example : 

(i) Do differences in manurial treatment affect the yield of some variety 
of cereal ? 

(ii) Does modification in the production process alter the quality of out- 
put of some manufactured product? 

(c) At the same time, in addition to these factors whose influence is under 
investigation, there may be other assignable causes of variation inevitably present, 
the effect of which would be obscuring were it not eliminated. Thus, for example, 
in the illustrations given above, there might be 

(i) Variation due to changing soil fertility. ^ 

(ii) Variation due to differences in the skill of operatives or in the state 
of wear of machines. 

(d) It follows that it is often possible to regard the variation in a character 
a : as made up of parts due to different assignable causes A , B t C , ..., and of a 
residual part vrhich, for the time being, we must attribute to chance causes. This 
may be expressed as follows: 

fy . «, t >, «>, ... 358 + b v + c w + .. . + T £ t . u , Vi w % ... (1)>* 

where is the character of the t-th individual of a group of n u , v , all of 

which receive the same contribution a u from the A factor, the same contribution 
b v from the B factor and so on. AT*. represents the residual term. In so far 

as the causes of variation are assignable, this grouping is possible. 

* [The reader must bear in mind that for (1) to be true the eifeot of the oauses A t B, < 7 , ... on the 
character x mast be additive. For example, the real effect of A and B might lead to a ratio ajb v in 
the expression for w m ... , in which case the assumption of an additive relation would involve the 
influence of A % B, C, ... appearing in X itUtVtt0) .... Bn.] 



Egon 8. Pearson and 8. 8. Wilks 


355 


($) The technique of analysis consists in arranging the data so as to test 
separately for the presence of an A factor, or a B factor, etc. as desired. This is 
effected by obtaining in each case two estimates of the unknown variance, of 
the residuals X , which would differ only through chance fluctuations if changes in 
the particular factor had no influence on variation within the limits covered by the 
experiment. 

This method of analysis is based upon the assumptions that the residuals X, 
(a) are normally distributed, and ( b ) have the same standard deviation, <r, whatever 
be the values of the terms a, b, c, etc., that is to say, for all combinations of the 
assignable causes. We may be justified in accepting this to be the true position in 
very many practical cases, but it should be recognised that the method outlined above 
does not put these assumptions to the test. There are indeed a number of problems 
in which (b) is not true and where the discovery of significant differences, from 
group to group, in the variation among the individuals, X , may lead to the identi- 
fication of further assignable causes of variation. Such has been found to be the 
case, for example, in the analysis of variation in quality of articles in mass-production 
industry. Further, we may be concerned not only with a single variable x , but with 
a number of correlated variables x t y t z, ..., and we may then need to examine the 
stability from group to group of the covariation as well as the variation among 
residuals X, Y, Z, .... 

(3) An Illustration of the Problem . The purpose of this paper is to develop 
certain methods recently suggested for dealing with these problems *. We shall 
treat here only the case of two correlated variables x and y, and shall suppose 
that the observations have been divided into k samples or groups. The problem 
will be to test whether these groups can be differentiated owing to significant 
differences either in the average values of x and y, or in the variation and covaria- 
tion of the residuals within the groups. The choice of suitable criteria, i.e. of the 
tests to be applied, has been based upon the use of the principle of likelihood as 
suggested by J. Neyman and E. S. Pearson. More recently + these writers have 
suggested a more fundamental method of determining the most efficient test of a 
statistical hypothesis; this method of choice has been shown in many cases to be 
identical with the method of likelihood, but in the particular problems we are now 
considering the correspondence has not yet been established. 

The following example, which is treated more fully below, will indicate the 
nature of the problem. In certain cases of manufacture tests of quality are destruc- 
tive. Such, for example, is a test for breaking strength; it is therefore important to 
find an alternative correlated measure which may be used in its place in routine 
testing. In dealing with metal products a measure of hardness, based on a test 
which is not destructive, is sometimes used as an index of tensile strength. If, 

* For the case of a single variable: see J. Neyman and E. S. Pearson: Bulletin de VAcadimie Polonaise 
dee Sciences et des Lettres , S4rie A, 1930 and 1931. For the case of many variables, see 8. S. Wilks: 
Biometrika , Vol. xxiv (1932), pp. 471 — 494. 

t Phil. Trans . of the Royal Soc. t Series A, Vol. 281, pp. 289 — 887. 



866 Analysis Appropriate for k Samples of Two Variables 

however, we are to predict strength from hardness, using the correlation method, 
it is essential that the degree of relationship between the two qualities should 
remain stable. It must not change from one plant to another or from one month 
to the next; in other words a preliminary research investigation should not only 
be concerned with changes in average strength and hardness which can be attributed 
to assignable causes, but also with the stability of the covariation among the 
residuals X and F. Table I shows a preliminary statistical analysis of 60 pairs of 
test results made on a certain aluminium die-casting, divided into 5 groups .of 
12 pairs. Within a group the assignable causes of variation are believed to be 
constant, but it is necessary to analyze not only the figures in the 2nd and 4th 
columns, but also those in the 3rd, 6th and 6th. 

TABLE I. 

Data for Aluminium Die-Castings *. {Samples of 12 observations .) 


Sample 

No. 

Tensile Strength 
(10* lb. per sq. in.) 

Hardness 
(Rockwell’s E) 

Coefficient 

of 

Correlation 

Mean 

Standard 

Deviation 

Mean 

Standard 

Deviation 


33*399 

2-565 

68-49 

10-19 

0*683 


28*216 

4-318 

68-02 

14-49 

0-876 


30*313 

2-188 

66*57 

10*17 

0-714 


33-150 

3-954 

76-12 

11-08 

0-715 

5 

34-269 

2-715 

69*92 

9-88 

0-805 


II. Derivation of the Criteria. 

(4) We shall suppose that each of k samples, Si, 2*, ... 2*, of v two variables or 
and y has been drawn from some normal population. Let ir t be the population from 
which 2 1 htfs been drawn and let the means of x and y in n r t be a t and b t , the standard 
deviations <r xt and <r yt and the correlation coefficient pt(t** 1, 2, ... k). Thus, the 
distribution law of ir t will be 

_ 1 r(x - a t ) a (y - b t )* 2 Pt (x - a t ) {y - 

i___ e 2(l-fc*)L <r*f 4 * * * 8 V Wu J /gy 

2 , rr<r xt <T v t ^1 ~ pt* 

Therefore* the probability of the joint occurrence of the samples 2* from their 
respective populations ir t {t « 1, 2, . . . A), with values of w and y falling in the intervals 
x ta ± dx t *> y«a ± dy ** (a » 1, 2, ... Ut, t *= 1, 2, ... k) will be given, except for infinitesi- 
mals of higher order than dx u and dy u , by 

c- n (- L^ === X , e -»dxdr (3), 

* The data are taken from W. A. Shewhart’a Economic Control of Manufactured Product , Maomillan 
1981. Although they would hardly he adequate for a research Investigatioh in prmotioe, they are suggestive 
and provide a good iUustration of method. 







Egon 8. Pearson and S. 8. Wilks 


86 f 


in which 


* I n, T 8 *** + (**- <**? , 8 v* ± (2/e - &t) 8 %Pt [**t 8ytr t 4- ( x t - a*) (y t - b t )J 

-i L -**t a (1 - />t*j 2<ryt 2 (1 - pt 1 ) ’ 2a-*t a# (1 - pf) 


where W t and y t are the means, 8^ and % the standard deviations*, r t the correlation 
coefficient of w and y, and n t the number of individuals in the sample 2*, and 

dXdY=* EE 11 dx ta dy ta (5). 

t — \ as»l 


We shall now consider the derivation of criteria for testing the following three 
hypotheses concerning the populations 7r t : 

(i) The hypothesis H that the populations t r t are identical, that is, that 

Vost — <T a j, Vyt^Gy* pt^p (®)» 

at = a } bt^b (t * 1, 2, ... k) (7). 


(ii) The hypothesis H% that the samples have come from populations with the 
same set of variances and correlations but having means with any differing values 
whatever, that is, that (6) is true whatever may be the values of the means a t 
and bf 

(iii) The hypothesis that the samples are from populations in which (7) is 
true, when it is assumed that (6) is true. 

These are generalisations to two variables of the three hypotheses considered 
by Neyman and Pearson f for the case of k samples of a single variable; or they 
may be regarded as special cases of the more general problem whose solution has 
been considered more recently by Wilks Thus, it will only be necessary to indicate 
briefly the steps involved in applying the method of likelihood to determine criteria 
appropriate in testing H> H\ and In each case we must fix : 

(a) The class XI of admissible sets of populations ir t (t = 1, 2, ... k) from one set 
of which the set of samples 2 f is assumed to have been drawn. 

(b) The subclass a> of XI to which the set n r t must belong if the hypothesis tested 
be true. 


Then we must find the maximum of G in (3) for variations of the population 
parameters under the assumption that the set ir t is (i) a member of XI ; call this 
0(X1 max); and (ii) a member of a>; this we call C(<o max). Then the expression 
for the likelihood of the composite hypothesis H has been defined to be 


„ (?(o>max) 

**-<7(11 max) 


( 8 ). 


Let us consider this X-criterion for each of the hypotheses JET, H\ and JSTg. 


* Hera and throughout the paper the standard deviation in a sample of n will be defined by the 
relation 2 (* - J)*. 

t Bulletin de I’Academie Polfinaies dee Seiencee et dee Letiree, Sdrte A, 1981. 
t Loc , cit. 



358 Analysis Appropriate for k Sample* of Two Variables 


(i) Criterion for JET. We find that G ( fl max) occurs when 


a t ^xt> bt — yt (9), 

<Txt 38 *xt , &yt m 8ytt pt~1't ($®* 1 > 2 , ... k) ( 10 ), 

and G(co max) occurs when 

a,** fia, 6 = yo (11), 

a x 2 = I'lio = Vila + Vii m , <T y 8 = VyaO + Vum> <*vP"* V M0 v Ua + Vjim • • *(12), 
1 k 1 ft 

where ^ y*™- 2 (13), 

ft nt ' 

Nv n o =22 Ova - # 0 )* = iVd»o 8 

*«1 a~l 

^=2 2 (^ -ytf-NaJ (14), 

^ t~la=l 

ft nt 


\(*+ “ *) (^a - yo) - J 

that is to say £ 0 > yo> #*o, fyo and r 0 are the means, standard deviations and correlation 
coefficient obtained on combining the N pairs of observations from the h samples. 
Further 

k nt ft 

Nvn a ~22 (x u - X t ) 2 = 2 n t 8ait Z 

1 a = l * = 1 

^^22®= 2 2 (yea- ye) 8 = 2 w*d y « a 

*=ia~i e=i 

ft nt _ Ar 

“22 Ot. - at,) (y„ - y«) = 2 «,**,«„, r, 

e=i aso i e=i 



At _ ft _ 

NVjxm =2 K, (®, - «o)*, - 2 n, (y, - y 0 )*> ftllm 

e=i 

We shall write for each sample ($« 1, 2, ... k) 


k 

2 «,(*,- s 0 ) (f/t- So) 


( 16 ). 


nt nt 

Wat = 2 (a:,. - *,)* = n t s x ?, n t v m - 2 (y,„ - - w,*^* 

a«l a»l 

m 

n t vm** 2 (xta-x t )(y ta ~y t )' as n t s a!t 8y t rt 


Placing these values in (3) and taking the ratio as defined by (8) we find, 

«< 




-Ateir 

(18), 

where * 

k*l = 

?“ H. % Vd-n‘) 

Vlit »«l 

t 

(19). 


l«ol— 



(20), 



# For convenience we shall call | vy t | the generalised variance of the eth sample with elements having 
«i - X degrees of freedom. Similarly, j Vy 0 1 , ] v<y m | and | v^ a | will be generalised variances derived from 
the combined samples, with elements having N - 1, ft ~ 1 and A T - ft degrees of freedom respectively. 



Egon S. Pearson and 8. 8. Wilks 


859 


(ii) Criterion for Hi. We find that (7(H max) occurs for the same values of the 
parameters as in the case of H, namely, those given by (9) and (10). C(<o max) 
occurs when (9) is true and when 




a, <r 9 <r y p**v 


.( 21 ). 


na 


Thus it follows that 

[teir ■<** 

where |tty a | is a determinant analogous to those of (19) and (20). 

(iii) Criterion for H%. C ( fl max) occurs when (9) and (21) are satisfied and 
C (w max) when the parameters have the values (11) and (12). Consequently 


N 


.v-rwi 1 .t. 


L [ Vija “h Vijm | J U V ifl\ \ 


As in the case of the single-variable problem we observe that 


Xjj ' 


: X 


.(23) 


.(24). 


III. Interpretation of the Criteria. 

(5) The Hi test . We note that the structure of each of the Vs given by (18), 
(22) and (23) differs from that of the corresponding X of the single- variable 
problem only in that determinants of the second order matrices of variances and 
covariances appear in place of the corresponding sums of squares in the single- 
variable case. In other words, determinants of the second order now take the place 
of determinants of the first order. 

We shall first examine \ H% \ in testing JET* we have assumed that (6) is true 
and logically we should first consider the grounds for this assumption, if necessary 
by testing H v The test of H^ is, however, related to R. A. Fisher’s tests in the 
analysis of variance, and it will be clearer to consider this first. 

A X-criterion must lie between 0 and 1, and if the principle underlying the 
selection is valid, as it decreases from unity towards zero we should be more and 
more inclined to reject the hypothesis tested in favour of some one of the admissible 
alternative hypotheses. How far do our intuitional requirements appear satisfied by 
\ Hl ? The ratio of determinants 

is of the form of the ratio i/r of Theorem I of the Appendix, if we set = Ay, and 
•Y = (26). 

It follows that 

(а) 0 < \ B , < 1 ; and when |%,| >0; 

(б) a necessary and sufficient condition for = 1 , is that = Xq, y t — y 0 
(t - 1, 2, ... k), that is, that all the sample means be the same; 


hk»l 

I 


.(26) 



860 Analysis Appropriate far k Samples of Two Variables 


(c) a necessary and sufficient condition that \ Bt ** 0 is that at least one of 
the differences x t — x 0t yt — Jo (t * 1, 2, . .. k) be infinite. * 

lb can also be shown by the ordinary methods of differentiation that cannot 

have any other maximum but that of unity (occuring when x t « x 0 and J* « Jo) for 
any given values of the v$a. In fact the maximum is the only stationary point. 
Therefore, if we keep the intra-sample variation the same, and allow the system to 
vary from one in which all of the sample means are equal to the other extreme in 
which at least one sample mean differs very greatly from the mean of the whole, 
will at the same time decrease from 1 to 0. 

The case where | %<* | * 0 (27) 


needs special consideration. The determinant is essentially of the form 

2 &a a 2 £ to Vta 
2 £ to Vta 2 Vti ** 


where 


k m 

2 = 2 2 , £ta 
1 


(gto ~ x t) 

\/T$ 


and 


Vta 


( yta- yt). 
</N “ 5 


(28), 


which can vanish only when Zta^cvta* i.e. when x ta - x t = c (y ta — yt) where c is a 
constant for all t and a, which may be finite or infinite. In this case the observation 
points ( Xta > Vta) for each sample lie on a straight line, and the lines are all parallel. 
If c ms 0 the lines will be horizontal as in Fig. 1 (a), if c « ± oo they will be vertical 
as in (6), otherwise they will be sloping as in (c)*. Hypothesis H% is exceedingly im- 
probable (and « 0) unless these lines coincide, which will occur only when 

Xta-Xo** c(y ta -yo), 


that is when ^-^“ 0 ( y t — y 0 ). In this case \ Bt — 0/0, and we are really reduced 
to a single- variable problem, and could apply the appropriate H 2 test for that case. 
It appears therefore that the criterion x *« does satisfy our intuitional requirements, 
at any rate as far as the limiting values 1 and 0 are concerned. 


(6) Alternatives to Xe,. In the one-variable problem \ Et is expressible in terms 
of t? a , the squared correlation ratio, and hence also in terms of 1 — i? # f. In fact, using 
the present notation, and considering the x variable only, 


«U. + %m Vua + Vun 


(29), 


and consequently it is immaterial which of the two ratios is regarded as the 
criterion. In the case of two variables the corresponding ratios will be 


Ij^l _ 

| Vya + Vijfn | 


| Vija + Vy m | 


V 2/A 

1 




.(30), 


but here U% cannot be expressed as a single valued function of L%. As will 
be shown below, the sampling distributions of both L% and IT* are very simple 
functions, and it is natural to ask whether U% migbt.be used as an alternative criterion 


* In the diagram the spots represent observation points (*, y) and the oiroles represent means of 
samples, 
t AetuaUy 



Egon 8. Pbarson and 8. 8. Wilks 


361 


for testing hypothesis decreasing from 1 to 0 as the hypothesis becomes more 
and more likely. It can be readily seen however that U% is not a suitable criterion. 
Suppose that the ity, are finite and not zero, so that there is variation within the 
samples; then when ■“►(). This may occur, 

(a) When and yt~+*yo for t * 1, 2 i.e. when the means of all 

samples tend to coincide, and hypothesis H 2 is probable. 



(b) But since | v^ n | is of the form (28) it follows .that it will also tend to zero 
when xt — Xi-+Q{yt-y*\ o being the same constant for all t This would happen 
when the sample means tend to lie on a straight line, and when, as suggested in 
Fig. 1 (d), hypothesis H* may be quite untenable. Clearly therefore U t is not an 
acceptable criterion. 

There are two other forms of alternative criteria which it is of interest to refer 
to here. On the assumption that the samples have been drawn from identical 
normal populations, it is possible to obtain from the data two independent estimates 

of both, 

(a) <r,<r„ Vl -/>*, 

; (b) it , <r y p. 


BiometrUca xxv 


24 



362 Analysis Appropriate for k Samples of Two Variables 


Case (a). If we write 0 — | vy, |, | »#m|, then it is known* that 

df(6)- 


9 N-k-» .iW-k-l) 

2 A . 00 *-*-* 


T(N-k-l) 

dm - ^ e-* 9 d<f> 


..( 31 ), 

•(32), 


where A * N*/[4>(r x * <r p 2 (l — />*)]. Furthermore 6 and ^ are independently distributed, 
and it may be readily shown from (31) and (32), using the symbol E for “expected” 
values, that 


E(s/0) - -~£- 1 <Tg <r v Vl - /,* <r x a v - p 1 ...(33). 

Hence ^“^31 */| *0® I an d ^ I Vijm I ma y ^e ^ken as independent estimates 

of cr x (T y \/\ — p a with elements having N -*lc and A; — 1 degrees of freedom respectively. 
If we now take the ratio of the two estimates, or 


A- 

2 V k«| 

it is found from (31) and (32) that 


(34), 


***'*[(*- 2) (A -*-l)] ( JV_3) chfr... (35). 

This is a Pearson Type VI curve; the 5% and 1% sampling limits for ifr could 
be obtained by taking 

* “ 4 loge V' 1 (36), 

and entering R. A. Fisher’sf tables of z with 2 (k - 2) and 2 (N - * - 1) degrees of 
freedom. 


The criterion yjr, will not, however, be suitable for testing the ^hypothesis H*. 
We may write 

| | “ &x m &yin (1 7* m 8 ) .(37 ), 

where 8 xm , s ym and r m are the standard deviations and coefficient of correlation of 
the weighted sample means. Then clearly, it would be possible for to be unity 
and | Vtya | fixed and finite, while r m -^l and either s^m or s % ^-foo. In such a 
situati on would be untenable and yet the two independent estimates of 
a* tr y sf 1 — p* would be equal. 

Case (by It can be easily shown that and are two independent 

estimates c % Gyp, but their ratio, say ^r', would again be unsuitable for testing H% 
for reasons similar to those holding in the case of ^r. It should also be pointed 
out that the sampling distribution of ifr' is extremely complicated. 

# 8. S. Wilks: loe. cit. p. 477. 

f R. A. Fisher: Statistical Method* for Retearch Worker*, 4th edition 1982. Edinburgh : Oliver 
and Boyd. 



Egon S. Pearson and 8. S. Wilks 


863 


These illustrations bring out forcibly an important but often neglected 
consideration. A critical examination of the efficiency of any statistical criterion is 
necessary before it is applied to testing a hypothesis. The fact that its sampling 
distribution is known if the hypothesis be true, does not by itself justify its use. 
In the present case in using yfr or we should be in danger of accepting the 
hypothesis H % in certain cases when it is evidently not true. It is only the 
likelihood criterion, X#,, which appears suitable for our purpose. 

(7) The test . X Hl has been defined by (22); clearly X^ is of the form of the 
ratio 6 discussed in Theorem II of the Appendix, and it satisfies all of the conditions 
of that theorem. It follows that X*J£ and consequently \ ffl 

(а) must lie between 0 and 1 ; 

(б) will be unity when and only when v ijt = (i, ; =» 1, 2) for all values of 

t and w, that is, when the variances and covariances of x and y are respectively 
equal in all the k samples; 

(c) will be zero when, (i) x ta — x t ^c t (y fa — yt ) for at least one value of t f 
where c t is a constant for all values of a ; this means that the sample points in at 
least one sample will lie on a straight line, and there is perfect correlation in 
some but not all samples. However, if there be a c such that x t * — x t = c (y t m - 2/e) 
for all values of a and has the indeterminant form 0/0, and the points of 

each of the samples will lie on a straight line and the lines will all be parallel. In 
this case the problem is reduced to that of a single variable, and the appropriate Hi 
test could be applied ; (ii) one of the deviations x ^ — x t or y tm — y t or both are 
infinite for at least one value of a and t (but not all values of t f assuming | tty* | > 0 
for all t) t subject to the condition that the limiting values of the generalised 
variances remain finite and not zero as these deviations become infinite. Under 
these conditions it follows by the argument of the proof of ( c ) in Theorem I of 
the Appendix that |v#>| becomes infinite while the \Vij t \ remain finite and different 
from zero. The situation is a limiting form of that in which the variation is very 
much greater in some samples than in others. 

(8) The H test Since X& is the product of X Hl and X Ht it must lie between 
0 and 1. It can be unity only when both \ Bl and X B% are unity, that is to say, 
when the means, variances and covariances of x and y in all k samples are respec- 
tively identical. It will approach zero when \ Hl or Xh% or both approach zero. 

As in the single- variable problem, the three X ratios appear therefore to satisfy 
our intuitive requirements as criteria for testing H , Hi and H it for they are 
quantities which tend to unity as the corresponding hypothesis becomes intuitively 
more probable (as far as the information contained in the sample is concerned) and 
tend to zero as it becomes more likely that the hypothesis is false. Whether the 
tests based on these criteria satisfy the more fundamental conditions laid down by 
Neyman and Pearson*, we do not yet know. The problem of testing these hypotheses 

* Phil Tram . Hoy. 8oo. t Ser. A, Vol. 281 (1988). 

24 — 2 



864 Analysis Appropriate for k Samples of Two Variables 

will be completed by determining the sampling distributions of the X’s on the 
assumption that the corresponding hypotheses are true, for without these we have 
no means of testing the significance of an observed value of X. In the following 
section we shall first give expressions for the moment coefficients, and then by in- 
verting the moment equations show how the frequency distributions may be obtained. 
The result is simple only in the case of A.#,; for &pd numerical values for the 
probability integrals can be obtained only by some method of approximation. 


IV. The Moment Coefficients and Distributions of the Criteria. 

(9) The Moment Coefficients. In the single-variable problem it was found to be 
convenient to study the sampling distributions of some fractional power of the X’s, 
rather than that of the X’s themselves, owing to the extreme skewness of the latter 
distributions*. The use of was suggested largely because in this case 


,2/N 
V // a = 


1-V 2 , 


(where 17 9 is the squared correlation ratio) and had a sampling distribution of Type I 
form. In the present bivariate case we shall find for similar reasons, which will 
become apparent as we proceed, that some advantage will be gained by using the 

y-th power of the X’s. The moments of the X’s have been given by one of the 
writers f in a recent paper for the case of an n-variate normal system. If we denote 
by Mm, Mm and Mm the ft- th moment coefficients about zero of and X^ 

respectively, when the corresponding hypotheses £T, Hi and H* are true, then we 
have at once from the paper just cited, for the case of two variables (i.e. 2), 


M % 


r 

Pf 



N-k+h) 
2 ) 

i r ! 

,N-k-\ + h\ 

K 2 ) 


FT I\ 
2 ) 

ir( 

<N-k - 1 

k" 2 

y r 



.(38), 


M 1 


1A 5 


r| 

P;*) 

iri 

r--n 

1 

r| 

{- 

- k + h\ 

2 J 

ir< 

— k — 1 - 1 - h'j 



r 

fn t ~l AmA 
2 *2 N) 

r( n ‘~ 2 + hnt W 
\ 2 + 2N)\ 

r (-s‘) 

r 

(S'-* ) 



(39), 


20 h 1 


■ Mm x Mm (40). 


* Cf. J. Neyman and E. 8. Pearson: Bulletin de V Academic Polonaise dee Sciences et dee Lettres , 
S4rie A (1981), pp. 476—476. 

f S. S. Wilks: Biometrika , Vol. xxiv (1982), pp. 471- 494. In this paper the generalisations of Air* 
and Ajj, were denoted by Nar<m)» A and X# respectively. 



Egon 8. Pearson and 8. 8. Wilks 


365 


These expressions may be considerably simplified by making use of the duplica- 
tion formula of the Qamma function which can be written 

r(«+i)r(a+i)-— 1 (4i). 

Applying this to (38) and (39) we get 


„ _T(N-2)r(N~k-l+h) 

iHjft 


and 




T(2T' 


r(^-*-i)r(Ar-2+A) 

*r(*-2 + $)1 


.(42) 


N-k-1) r * /N\ * T 1 V ' n 

-k - 1 +h) [U ) T(n,- 


2) 


.(43). 


(10) The Distributions of L* and U%. To find the distribution of we 

use the relation 

r(N-2 + h) r(A:-l)J 0 W (l u r du w 

in (42). Accordingly, we find that the h~ th moment of X a is identical with that of 
w, where the distribution of u is 


F (N — 2) 


**-*■*(1 


.(45). 


Therefore it follows from the uniqueness of the solution of the moment problem 
for a finite interval* that the distribution of X 2 must be identical with that of u, 
and is given by 


f ( #-V- 1) *(t - 1 ) - w-JA 


.(46). 


In a similar manner, it follows that the distribution of U a is given by 




.(47). 


In both these cases the probability integral is an incomplete B-Function. 

(11) The Distribution of L\. Let us consider the sampling distribution of 

say. The A-th moment of L\ about zero is given by (43). If we multiply and 
divide (43) by r(iV — 2k + h) and tpen use the following relations, 

-T ~ + j) a , . (l _ 


“du 


.(48) 


and 


nr( W ,-2+Ap t ) 


T(JV-2ifc+A) 

-/Jo'... f*~\ 1-Vi-vi- . . . - v*-i)"* -8+Ap * n (v e n <-® +A »'« <&>,) ...(49), 

* See W.'Bteklofl : SKmoiret de V Academic ImpMate da Science! de St Pttenbomg, Vol. xxxin, No* 9 
(1915): 



366 Analysis Appropriate for k Samples of Two Variables 


where Jj**l - v, — — »,• (i = 1, 2, ... A — 2) and = *1, 2, ... k), we find 

that the A-th moment (A = 0, 1, 2, ...) of 2<i is identical with that of 

<£= _ - 1 r- uvi^i V • • • vt-i**-* (1 - «i (50), 

P\ Vx P% V% ••• PlT k 

where u and the »’s are distributed according to the function 

Cu N ~ u ~ 1 ( 1 - «)*-*( 1 - - fs - . . . II v*"* -8 (51), 

<■1 

where 0 < u $ 1, v t > 0, and «i + v»+... + v<l and (7 is a constant depending only 
on k and the n’s. Therefore, it follows from the argument used in establishing the 
uniqueness of (46) that the distribution of L\ is identical with that of cf>*. The 
problem of finding the distribution of <f> is equivalent to that of solving (50) for 
the u or one of the v’b and substituting in (51) and integrating with respect to all 
variables except <f>. This process is extremely complicated, even when the p’s are 
all equal, that is, when m = n* = . . . = n* = »; in this case we can find an expression 
for the distribution of the L i by considering a transformation of Mu- The new form 
of Mu is found by applying the transformation f 

r< "“ ) -( 5 ^ rwr (' + s) - r (‘ + 5 s-') < 62 > 


to r (N — k — 1) and r(i\T— k — 1 + h) in (43), by writing m = k and z = n — 1 — 1/k 
in the first and m «■ k and x = n — 1 — 1/k + h/k in the second. Accordingly, we get 


where 


M lh = C 


r 

c 


r* 

( n - s+ t! 

) 

( k + 1 h\ 
(*- h + i) 

|F( 


••• r| 

hM) 


r ("-T i ) r (»-D- r ("-l) • 

I^(n — 2) * 


(53), 


Distribution functions with moments of this type have been considered by one 
of the authors $, from which we can write at once as the distribution pf Li 


OJ (* + l)(fc + 2)-2 , 

d/(I x )=C'i 1 w -"(l - Lt)* » l d{L{) 


fSA 


k-l k 2k - 8 

0 t k ^ 1 0 t k' m ...0 k -! k 


* In this connection we note that a simple alternative proof of (a) in Theorem II of the Appendix can 
be constructed at once for the case where the p’s are rational numbers and the am are product moments, 
by showing that the maximum of ^ is unity for variations of u and the e’s in the region over whioh (81) 
is defined. Indeed, for a given value of u we find that the only stationary point with respeot to the v's 
is the true maximum whioh ooours where r*=p t (t=l, 2, ... k~ 1). Therefore, 0, which is necessarily 
positive, has a maximum of unity, and since the range of ^ and L x must be the same we have 
that is, 0^8^ 1. 

t See Whittaker and Watson: Modem Analysis (4th edition), p. 240. 
t Wilks, loo . eiu pp. 474 — 475. 



Egon 8. Pearson and 8. 8. Wilks 


x (1 >1 - - 1 (1 - d,) 8 1‘- » 


2 (*-3)-<*zi!Ar.?-i 


8 _»Jb»_ i 

...(!'- «(w) a* 


* *+i 

X [1 - 0i(l - Iflrni - [*i + <9.(1 - * 1 )} (1 - la*))— ... 
x [1 — {^x + 8t (1 — 8i) + ... + 6/,- 1(1 — 0i) ... 

2*- a 

... (i - *»-*)} (i - W)]~ r de 1 ... de k . x (54), 

where (using formula (52)) 

„ C T(N-k-l) 

r ^-2^"r*(n-2)r(A ; -l)^-«*’ 

a slightly more condensed form of (54) can be obtained by setting 

0 t = l (t = l,2,...£-l). 

(t+l) (*+*)-> 

Thus d/(Z!) = (1 - L^r « " 1 d (V) 

xff , ^-1-. 

JoJo h 

k 1 i fc x 

*[(i-<M r ~ (W«) r (1-<M * " ...(1-^t-i * "] 

- 1 -- 

x [i - (i - mi - v)]- 1 [i -(i-<m,xi - m *... 

_ 2+ ? 

. . . [1 - (1 - hfr . .. fc_l) (1 - I,*)] * d&dfc . . . dfc-t 

(55). 

The distribution of L\ for two samples ( k = 2) turns out to be 

d/(£i) - p il ‘" _8 l0g ( 1 + Zl *) dZl (56) ’ 

and the significance of an observed value L x can be obtained from the probability 
integral (57), which results from integrating (56) by parts. 


P(ia<M»fV(^) 

J o 


r(n-l)I> 




T hin expr essi on depends on the Incomplete Beta Function. When k > 2 we have 
so far been unable to find any simple expression for df(L{), and some method of 
approximation seems necessary. Approximates methods are discussed below. 

> (12) The Distribution of L. In a similar manner, if we let then the 

A-th moment of L when «i'»n s = ... is 

rv«*- 2 ) ,. r *(’ > ~ 2 ' l 't) 

Mth T(rdc-2 + h)^ T*(n-2) (58)> 



888 Analysis Appropriate for k Samples of Two Variables 


and the distribution of A can be expressed as 

d A L ) = pf(^2)r (2^2)^** ■ L<n " 3,i (1 ~ Lk) 


2k 


,(k-a)(k-a)-a , 


a* 


d(£*) 


J 'JA* 






0a - '" * 

X (iV^W' rl -d— ^_ 1 ) a+ *; 4 " 1 [i— a— fcxi— /.*)]" 8+i 

-8-L-J 

; , . . . dipt - x (59). 

For the case k = 2, (59) becomes 

d/d) - I ,n - 5 {log ( l + V ] Tr£i ) _ Vl~^Z*J «*£ ...(60). 
The probability integral of (60) assumes the form 

PlL < *>-/>« - ror- T/r^g- { <,,, ~‘[ log (-4^) 

- (1 - P) 1 ] + i /V~* (1 - »)* %} • • -(61). 

Again, if k > 2 some approximate solution appears necessary. 

We note from the distributions of A x and A that (55) and (59) are actually the 
distributions of the n-th roots of \ Bx and \ B respectively. 


(13) Approximate Solution for testing 2/ x . When k> 2 it appears necessary to 
employ some approximate method to calculate the probability integral of the 
sampling distribution of \ Hl or of Li ■» X l J £ . To establish the best and simplest 
method of procedure fuller investigation is required, but we believe that the 
relatively simple form of approximation which has been used in the single variable 
case is also suitable here. This involves the assumption that the sampling distribu- 
tion of A x may be represented by the law 


/(^i)~ 


T (m x + w a ) 

r (mi) T (rii a ) 


A x "» i~ l (l 


(62), 


where m x and m% are determined so that the first and second moment coefficients 
of /(A x ) about A x = 0 have the values given by (43) for A « 1 and 2 respectively* 
In other words we represent the distribution of A x by a Type I curve having the 
correct , terminals and first two moment coefficients. In many practical applications 
it is possible to plan for the number of individuals in each sample to be the same, 
i.e. for n t «*n(£= 1, 2, ... A). This is the situation cpnsidered in the illustrations 
which follow. The equations for determining m x and m% then become 


fWi 


i/u-ifii 1 f 




(1 -M u )(Mk~Mu) 


(63), 



Egon 9. Pearson and 9. S. Wilks 


869 


from which, since H * nk, we obtain from (43) 


M* 



r(w-2) j 


For the probability integral of (62), we have 


.(64), 

,(65). 


P (Xi < l x ) * I ix (mi, m g ) (66), 

which is the Incomplete Beta Function. This may be obtained from the Tables of 
the Incomplete Beta Function * if mi and mg are < 50 or by means of R. A. Fisher’s 
^-transformation as has been suggested elsewhere f. If mi and mg are both large or 
nearly equal, (62) will approach the normal form and the ratio 

Li — Mean Li ^ Ig — M xx 

&Li \/ A/ ig — M ii* 

can be used as an index of significance to be interpreted on the normal probability 
scale. 

The probability integral of L = X^ could be obtained by a similar approximation, 
but in general it is likely that the hypotheses H x and H% will be tested separately. 


V. Practical Illustrations. 

(14) Example 1. Relation between Tensile Strength and Hardness in Aluminium 
Die-Castings. 

This example has been referred to above in Section (3). We shall proceed first 
to test H Xi that is, the hypothesis that there is no significant difference between 
the samples as regards variation and covariation in strength and hardness. A 
summary of the necessary calculations is shown in Table II; we have N ** 60, 5, 

w = 12. The unit for a? (strength) is 1000 lb. per square inch, and for y it is 
Rockwell’s E . 

From (22) we have 

Hi 

and as indicated in the table it is found that L x « *9065. From (64) and (65) it is 
found that if H x were truej, then Mean L x ** M xx ■* *889274, Afig = *792592, 

* To be issued shortly as a Biometriha publication. 

t Biometriha , Vol. xxiv. p. 415. 

X Brownlee's Seven-Figure Tables of the Logarithm of the Gamma Funotion were used, Tract* for 
Computer*, No. ix. 


nuM)* 

i^r 


(68), 



870 Analysis Appropriate for k Samples of Two Variables 

TABLE II. 

Strength («) and Hardness (y) in Aluminium Die-Castings. Test of Ht ( bivariate ). 


t 

Sums of Squares 

' 

Same of Products 

Generalised 

Varianoes 


(Sample 

No.) 

™ lit 

« 2 (*t*-*t? 

a^l 

ni ’«i 

= r (»,a-F) a 

ac 1 

nv w 

n 

= 2 


log 1 Vq, I 

1 

78*948 

1247*18 

214*18 

365*204 

2*56254 

2~ 


2519*31 

657*62 

910*401 

2*95923 

3 

67*448 

1241*78 

190*63 

243*029 

2*38566 

4 

187*618 

1473*44 

375*91 

938*451 

2*97241 

3 

88*456 

1171*73 

259*18 

253*281 

2*40360 

Totals 

636*165 

7653*44 

1697*52 


13*28344 

6 



=A» I2o 


- * iog(M)- 


Kol"= «na«m-<’i*. 3 -&52-018, 

log Li = $ log Jt ( | »«( | ) - log ( | | )| (from definition (68)) 

= 1 - 957367 , 


2., = -9065. 

Estimate of correlation'! JVvm 

within samples ) r *~ 


•f* 7693. 


aLx * » *04223. The observed value of L\ is therefore nearer to unity than the mean 
value expected in repeated samples, and the ratio (67) is only + 0*41. Therefore 
there is clearly no reason for rejecting H x , If, however, we were to proceed in more 
detail we should find from (63) that mi = 48*210, and = 6*003, and by interpolating 
in the Tables of the Incomplete Beta Function that P(L X < *9065) = *621. 


We may now proceed to test H%, the hypothesis that neither mean strength nor 
mean hardness differs significantly from sample to sample. Table III contains a 
summary of the calculations in the form of an analysis of variance table. It is seen 
that L% *6896, while if H* were true, the probability law for L% is obtained from 


(46) as 




_ 

r(54)f(4) 




(69). 


The mean of (69) is *9310 and the standard error* is *0330, so that the observed 
value differs from the mean by more than seven times the standard error. By actually 
integrating (69) it is found that P(Zg< *6896)*= *0000019. H 2 must clearly be 
rejected. To discover whether this is due to significant differences in mean strength 
or in mean hardness or in both, we must consider the two single-variable problems 


For (46): Mean L 9 = 


N-2 9 


«X»- 


N-2 


M- * - *)(*-*) 

V N-l 















Egon S. Peabson and S. S. Welkb 871 

TABLE III. 


Strength (w) cund Hardness {y) in Aluminium Die-Castings . Tests of H%. 



Degrees of 
Freedom 

Sams of 
Squares (x) 

Sams of 
Squares (y) 

Sams of 
Products («, y) 

Generalised 

Variances 

Between 

Samples 

I 

^■4 

1 

Afy llw « 306*089 

•Wtom = 662-77 

Av 18m » 214-86 

9 

Within 

Samples 

55 

jV® n „- 636-165 

•AVa*,- 7663-42 

Nv la®* 5 1697*62 

K„|- 662-018 

Totals 



toras,^ 8316-19 

Nvv** 1912*38 

1^*1—1160-^77 


Z a -X a I/ ^-V|%«|/|^ 0 |-*689G l 
»?*t 8 -%in/Vno= *3248, v^Jv^' 0797. 



Estimates of 
Variance 

logio (est.) 

Estimates of 
Variance 

logio (eat.) 

Between 

Samples 

76-522 

k~\ 

1*883786 

to’**— a.105-69 
k- 1 

2*219296 

Within 

Samples 

N J" a . = 11-666 

— k 

1*063183 

£***■-139-15 

N-k 

2-143483 

Difference 

*820603 

Difference 

•076813 

*ra 1 *16129 x Difference 

*9448 

z 

*0873 


separately. Tests may be applied to the squared correlation ratios 17^* and 17^*, or 
R. A. Fisher’s ^-transformation can be used. The necessary calculations are shown 
in Table III. 

Using Woo’s tables*, it is found that rj xt 2 is clearly significant while rj yt * is not. 
Alternatively, referring to Fisher’s ^-tables f with ni=®A? — 1*4, naassi^— 55, it 
is seen that the 5 % point lies at about *47 and the 1 % at about *65, showing, as 
before, that mean strength differs significantly from sample to sample but not 
mean hardness. 

The limited amount of test records available would therefore suggest the 
following tentative conclusions: 

(a) Within the samples the relationship between the two qualities is stable, 
and represented by 3*401 x 10 8 lb. per sq. in., <r„* 11*80 in Rockwell’s E, 

* Tables for Statisticians and Biometricians , Part II, Table IV. 

f Statistical Methods for Research Workers , Table VI. 






















872 Analysis Appropriate for k Samples Of Two Variables 

r my * «f *769. (The first two values are the square roots of NvnaKN - k) and 
NvvtaKK- k) respectively, and the last is the value of v a given in Table II.) 

(6) While the variation in mean strength frotn sample to sample is imperfectly 
controlled, the variation in hardness appears no more than might be expected 
through chance. , 

From the practical point of view this is not an altogether satisfactory result and 
further investigation into the anomaly ( b ) would be necessary before hardness could 
be used with confidence as an index of strength. * 

(15) Example 2. Relation between Length and Breadth of Human Skulls . 

The data consist of standard measurements of length and breadth of skull in 
millimetres obtained for 20 adult males from each of 30 different races or groups # , 
i.e. N*=600 t 20, and &»30. That there would be considerable inter-racial 

variation for mean length and breadth was obvious, but it seemed to be of interest 
to examine the hypothesis H lf that is to say, to test the extent of inter-racial 
uniformity in the relationship of length to breadth. These characters appear 
sufficiently nearly normally distributed within a race for the normal-theory tests to 
be applicable. Length will be denoted by x and breadth by y \ a summary of the 
calculations is shown in Table IV. We find from these that 

| civ. |« 656-369 \ 2 log { | | } - 2-644429 

fc 

logK«| ” 2-817148 
Difference 1 827281 
log Lx = £ x difference * 1*913640, Lx «= *8197. 

From (64) and (65) we obtain Mean L x = Mxx « *923678, M w *» *853317, *0117. 

The observed Lx is below the expected mean value, and the ratio (67) is — 8*9. This 
is so clearly significant that, without further refinement in calculation, we can say 
that Hx is untenable. We must now examine whether this lack of uniformity is 
present both in the group standard deviations s M and s yt , and in the correla- 
tions r m . 

For the first problem Neyman and Pearson's single variate test for Hi may be 
appliedf. This involves the calculation of the sums of the logarithms of the 
quantities nvnt and of v m given in Table IV, since in this case 

Za-x’f. n ( 70 ), 

1 t*l 

where t *■ 1 for length and t = 2 for breadth. The calculations are shown in Table V, 
Vxia and van being obtained from Table IV. It is found that were H% true, then: 

Mean L% * *9496, * *0129. 

* We are indebted to Dr G. M. Morant for providing ue with the necessary sources of information, 
t For an Ulnstration of the use of this test, see Biometriha, Voi. xxrv, p. 410. 



TABLE IV. 

Length (x) and Breadth (y) of Skulls. Tests of Hi; data for separate *amj 


Egon 8. Pearson and 8. S. Wilks 


373 


c* 

■+ + + i + +i + + ’t + i + + + + + + + + + + + + + + + ’1 + + 


JF 

^c»5«5^5*-'goe5»ftGO«N^'<weocDco»-^iO©aoQaor*!— «ao-«fooH;oooo 


J 




143-92 
149*20 
136-25 
142-45 
145-70 
127-05 . 

132- 45 

135- 85 

138*97 

133- 42 

136- 50 

133- 90 

131- 42 

148-07 

143*75 

145-97 

133*55 

141-65 

131*65 

134- 90 

141-42 

145*30 

142*27 

130*00 

132- 05 

138-80 

139*25 

141-80 

151-20 

140*80 


IH* 

172-00 

183-30 

167*95 

191*62 

176*80 

195-70 

182*80 

174-30 

174*50 

191- 72 

185- 62 
186*86 
184*65 
179-45 
181-10 

192- 10 
192-75 
189-25 
176-85 
177*25 
171-57 

189- 05 

186- 05 

187- 72 

188- 40 
178*92 

190- 00 
168*10 
177*75 
178-00 


2 

454-91 
205-05 
112-55 
1895*47 
427-99 
326 12 
1335-65 
559*23 

950-46 

683*33 

393-95 

1041*45 

434-13 

1208-29 

1406*38 

436-75 

188-88 

13*54 

1139-00 

2235-48 

966*43 

108*47 

280-84 

413-87 

426*98 

299-93 

269-67 

275-71 

595-34 

268-55 


c 

41-50 

57*80 

149-50 

-12*88 

72-80 

427*30 

-27*20 

423-90 

8*50 

-59*16 

89-03 

-99-28 

140-23 

227-08 

14-25 

207-80 

123-75 

215*75 

11-95 

153*50 

112*61 

245*70 

33-97 

204*50 

104*60 

133-70 

241-00 

-199*60 

11-00 

86-00 

? * 

5 H 

4- 11 

1 

to 

£ I 

300*14 

391- 20 

311- 25 
777*45 
320-20 
280-95 
472-95 

558-55 

881-24 

331-64 

319-07 

382-30 

356-64 

748-14 

550-75 

312- 24 

278-95 

158-55 

700-55 

1033*80 

392- 14 
212-20 

.488*24 

368*50 

448-95 

303*20 

237-75 

393- 20 
587-20 
239*20 

c £ 

S3 * 

WM 

§ 

612-00 

218-20 

216-45 

97 6-44 
551*20 

1114- 20 
1131-20 

722*20 

431-50 

834*74 

518-71 

1115- 45 
542*05 
714-95 

1021-80 

697- 80 
325-75 
327*75 
650*55 
887-75 

1018*14 

488-95 

232*45 

562-74 

404- 80 
454-64 

698- 00 
381-80 

405- 75 
480-00 

18736*96 

= iVt>i U 

I 

A€tafi 

Aleuts 

Andamanese 

Anglo-Saxons 

Armenians 

Australians. (South Australia) 

Australians. (Victoria) 

Copts 

Dayake. (Borneo) 

Easter Islanders 

Egyptians, 1st Dynasty. (Abydos) 

Egyptians, ModeJm. (Cairo) 

Egyptians, Predynastic. (Badari) 

English, Mediaeval. (Hythe) 

English, Roman f (Spitalfields) 

English, 17th century. (Farringdon St) 

Eskimos. (Greenland) 

Gauncha (Canary Islands) 

Hindus. (Bengal) 

Indonesians. (Ceram) 

Javanese 

Mongols. (Urga) 

Moriori. (Chatham Islands) 

Negroes, Teita. (Kenya Colony) 

Papuans. (New Guinea) 

Tagals, (Philippine Islands) 

Tasmanians 

Turks 

Swiss. (Munster) 

Venezuelans 

1 












374 Analysis Appropriate for k Samples of Two Variables 


Consequently we have 

Observed Li 
(L% — Mean Lx)j <tli 


Length (a?) 
•9000 
— 3*84 


Breadth (y) 

•9074. 

—3*27 


TABLE V. 

Length and Breadth of Skulls. Tests of Hi ( Single-variate ). 


^ X log (20» w ) 
log 20 

Length (a) 
i=l 

Breadth (y) 

i=2 

2-749829 

2-599191 

1*301030 

1-301030 

55 **•<**> 

1-448799 

1-298161 

log (»<(.) 

1-494548 

1-340349 

log Li 

1-964251 

1-9678X2 

Li 

•9000 

•9074 


The divergence shown by the ratios is significant, and it does not seem necessary 
to enter here into the approximate calculation of the probabilities P (L\ < *9000) 
and P (Li < *9074), (which are both under 01), since examples have been discussed 
elsewhere and it is hoped to publish shortly convenient tables for use with the test. 

We must now examine the variation among the 30 correlation coefficients 
r*yt(t°* 1 , 2, ... 30); the best method of procedure is probably as follows: 

If x and y are normally correlated it is known that in repeated samples of 
n that 

{log. ( 1 + r) - log, (1 - r)} (71 ) 

is approximately normally distributed with a standard error of 1 / V n— 3*. Consequently, 
we may test whether k independent values of r differ only through chance fluctua- 
tions from some unknown population value of p , by calculating 

^-2{(n,-3)(V~^) f } (72), 

* <=1 

where S' = 2 (z t '/k), and entering the (v*, P) tables with k — 1 degrees of freedom 

(i.e. n' * k in the notation of Elderton's Table). In the present instance it is found 
that x* * 96*01, while n ' « 30, which is evidently significant. x> deviates from 

the expected value by about 6*3 times the standard errorf. 

We have found therefore that the covariation in length and breadth of skull 
within a race cannot be considered as uniform from race to race ; further, that while 

* B. A. Fisher: Metron , Vol. x. No. nr. p. 18. An illustration of using this test with A; values of r has 
been given by L. H. O. Tippett: The Methode of Statietiee (1981), p. 148. 

t This result is obtained by nsing the rule that when/=«' - 1 is large, *J %\ jp - 1 is approximately 

normally distributed about zero with unit standard error. 



Boon 8. Pearson and 8. & Wilks 


are 


the standard deviations certainly differ significantly, the lack of uniformity is due 
in much greater degree to the instability of the correlation coefficient. Having 
regard to the great variety in the data, these results were to be expected since 
a “ race ” is a loosely defined term, and the coefficient of correlation between two 
measures of size within a group will depend upon the homogeneity of that group. 
The more similar the skulls are in shape the higher is r likely to be. In considering 
the possible value of the criterion L% in anthropometric work, it should be remem- 
bered that, although the present paper is concerned only with the case of two 
correlated variables, the general theory developed by one of us* is applicable in the 
case of any number of variables. 

Although the difference in mean length and breadth for the different races is so 
obvious that a statistical test is hardly required, it may be useful to summarise what 
would be the formal method of approach : 

(1) Considering the two variables together it is found that Hi (x and y) is quite 
untenable ; therefore we should not proceed to test H 2 (x and y). 

(2) Hi (, x ) and Hi (y) are also improbable, but the differences in the standard 
deviations s x and 8 y are hardly sufficient to invalidate the tests H 2 (x) and H 2 ( y ). 

(3) From two tables for analysis of variance similar to those contained in 
Table III, it is found that = *6489 and rj yt 2 = '6294. If H 2 were true for the case 
of a single variable, then Mean rj 2 = '0484, and *0124. Clearly, therefore, H 2 (x) 
and H 2 (y) are untenable, that is, the 30 samples of 20 provide convincing evidence 
that the racial mean characters differ significantly. 

VI. Conclusion. 

Certain general methods of analysis of multivariate data have been developed 
by one of us elsewhere. In the present paper the special case of two correlated 
variables has been taken, in order to illustrate (a) the process of reasoning under- 
lying the methods, (6) the practical application of the resulting tests, (c) their 
relation to other tests in use. The following points may be emphasised : 

(1) It is necessary to recognise that in many problems, hypotheses of the Hi 
type need to be tested, as well as those of the H 2 type. The technique of Analysis 
of Variance does not appear suited to deal with the former when more than two 
samples are concerned. 

(2) In the multivariate problem it would be possible to deal with the variation 
of each character and the correlation of each pair of characters separately, but the 
application in the first instance of a single comprehensive test has several advantages. 
If, for example, on the evidence available H can be accepted, it is unnecessary to 
proceed to test H t and H 2 . Similarly, if Hi (using p variates) can be accepted there 
should be no need to proceed to the p single- variate H x tests and the — 1) 


* Wilks, loc. ciu 



370 Analysis Appropriate for k Samples of Two Variables 

correlation tests. The same situation arises in dealing with if*. Even when the 
comprehensive test is hot satisfied, and it is necessary to apply the separate tests 
in order to locate the source of disturbance, relatively little labour will have been 
wasted in applying the comprehensive test first. For example, in the case of two 
variables which has been illustrated, the calculation of log|tty<| needed to test H% 
(two variables) involves little extra work when once nv m aad have been 
computed. But the latter quantities are in any case required if the sample variances 
and correlations are to be considered separately. 

(3) The methods suggested for calculating the significance of a given value of 
the X or L criteria are admittedly not in final form. For convenience in practical 
working, tables to be entered with n and k are needed, which would show certain 
levels of significance of these criteria. The possibility of forming such tables is under 
consideration. 

(4) It has been assumed throughout that the variables are normally distributed. 
Some investigations on the stringency of this assumption are in progress. 


VII. Appendix. 

To assist in the interpretation of and as criteria for testing the corresponding 

hypotheses H, H\ and /i 2 we shall find it convenient to prove the following theorems. ^ 

Theorem I. Let 1, 2 ; lc** 1, 2, ... m) be any set of real number *, and let the matrix 

I -dia II 

I A it A%2 II 

be real and positive definite with A Vi ^A 2 i and 

1-^11 ^12 | 

yL — , [ A 2l A 2 _ 

-dn + Xf/l* 2 A Vi + '2,r) lk r) Vc j 

I -^21 + 2»7ifc»/sub 

where the moment products offs are summed for k from 1 to m; then 

(a) 0 

(b) a necessary and sufficient condition that ^*1, when 

\Aij | >0, is that ^**0 2 ; £«=1, 2 , ... m\ 

(c) a necessary and sufficient condition that ^ = 0 when | A^\ >0, is that at least one of the fs 
be infinite. 

PaabFi Lot the determinants in the numerator and denominator of ^ bo called A and B 
respectively. Then (a) can be shown at once by induction, for suppose B^A for k=*t, then for 
1, B can be wntton as A+yi + i, where B t is the value of B fpr k**t and y <+1 is a positive 
definite quadratic form in <+i and lfat+i* Therefore, setting 0, 1, 2, ... m, we get 

B^ ... ^Bt+i^B^ ...>5 0 w«4>0, 
which is equivalent to the proof of (a). * 



Egon 8. Peakson and 8. 8. Wilks 


m 


The sufficient conditions in (6) and (c) are obvious. To prove the necessary condition of (6) we 
observe that for B t + 1 « B t i t is necessary that 1 1 + 1 « ^ t + 1 = 0 since q t + 1 is positive definite in these 
two variables. Setting f=0, 1, 2, ... m, we see that a necessary condition for 

B ~ — B t + 1 ~ B t =z , A, 
or ^=1 is that 17^=0 (t*l, 2 ; £«=1, 2, ... m). 

To prove the necessary condition in (c), we note that at least one member of the non-decreasing 
set B . . . B t + j , B t . . . B x must become infinite. Let 1 be the first B which is infinite, then clearly, 

• . • ■#< + 3 > B t + 2 will also be infinite. But B { + 1 =» B t + 0, + 1 , where is finite. Therefore + j must 
be positively infinite which can occur only whon at least one of the numbers tf hi + ! and ijm+i 
is numerically infinite. 

The proof of this theorem for the case when A and B are determinants of the »-th order can 
be carried out in essentially the same way as the one just given for two variables. 


Theorem II. Let a iit (i, j =* 1, 2 ; t «= 1, 2, . .. k) be any set of real numbers in which = a iit for 
i , and such that the matrix J | a^ t 1 1 is positive definite, and let A =| where 

k 

(i), 

and the p’s are positive such that 1p t = 1 . Then , if 

nK,|^ 

r~ («)> 

we have 

(«>„<> l, 

( 1 b ) a necessary and sufficient condition that 1 when \ a iit | >0 (*»1, 2, ... k) is that — 

( 1 , j = l, 2; £' = 1, 2, ... k ) ; is, that the matrices 1 1 a ijt 1 1 be identical . 

Proof: For the one-variable case, that is, when the theorem simply states that the 

weighted geometric mean of a set of positive numbers cannot exceed the weighted arithmetic mean 
of the set, and that the means can bo equal only when all of the numbers are equal. The proof 
for this case can be found in a number of advanced algebra text-books and will be assumed. 

The sufficient condition in ( b ) is obvious. Thus, let us consider the necessary condition. For 
convenience let 

$ — A~Q (iii), 

hr 

where G—Ud t p t and d t - [a# t |. 

t=i 

Then the theorem reduces to the problem of showing that </> >0, where the equality will hold only 
when the matrices \ \a ijt \\ are identical. Consider the minimum of for variations of the a’n. If 
it exists, it will be given by the equations (< = 1, 2, ... k ), 

(iv), 

0 (V), 

0 .(Vi). 

By a straightforward combination of the first equation for all values of t , 

k 

A 22 — VLa^ft (vii). 

<-1 


Biometrika xxv 


25 



(viii). 


878 Analysis Appropriate for h Samples of Two Variables 

Similarly, with respect to (v), 

k 

An^nauft 

<~1 

From (vi) (t t 2, ... k) . ... 

But from the case of a single variable (vii) and (viii) can hold only when 

a m zssa m > (** 2, ... k) (x). 

Call their common values a n and respectively. 

Placing these values in (ix) which must hold for all values of a n and a n , we get at onoe that 

a l2t **aw (i t , ffm 1, 2, ... k) (xi). 

Let the oommon value be a 12 . Therefore (iv), (v) and (vi) are satisfied only when the matrices 
| |oty«| | are identical. The matrix of second order derivatives of <J> with respect to the can be 
shown to be positive definite when (iv), (v) and (vi) are satisfied, provided ||o^(| is positive 
definite. Thus, <j> has a true minimum, and since the minimum is zero, and G is positive (a) follows 
at once. The generalisation of this theorem to the case where i>j** 1, 2, ... n is straightforward. 



ON A METHOD OF DETERMINING WHETHER A SAMPLE 
OF SIZE n SUPPOSED TO HAVE BEEN DRAWN FROM A 
PARENT POPULATION HAVING A KNOWN PROBABILITY 
INTEGRAL HAS PROBABLY BEEN DRAWN AT RANDOM. 

By KARL PEARSON. 

Probability integrals have now been tabled for the following curves, all of which 
occur frequently in statistics, either as accurate or approximate distributions of 
statistical quantities : 

_ l ^ 

(i) y^yoe the “ normal ” curve (probability integral provided by Shep- 

pard’s Tables*). 

(ii) y—y qx p ~ x (\ —w)*- 1 (Pearson’s Type I, Tables of the IncompleteB-Fwnctzonf). 

1 — (Pearson’s Type II, ditto). 

A J\m 

1 + o®) (^ ear8on s Type VII, " Student’s ” * § -Curve, Tables of the 
Incomplete B- Function). 

_yx / Q>\p 

(v) y*=y Q e + (Pearson’s Type III, Tables of the Incomplete T -Function). 

(vi) y ** y 0 (pc — a) m i/or m * (Pearson’s Type VI, by transformation to the Incomplete 
B-function). 

(vii) y~y 0 eryl x x-P (Pearson’s Type V, by transformation to the Incomplete 
T-function). 

(viii) y^y*er x l 9 (Pearson’s Type X, Newman and Glaisher’s tables $ of the 
Exponential Function). 

In these and a number of other cases the probability integral can be found or 
easily computed. The probability integral of the T v (x) Bessel function curve has 
also been provided by Miss F. N. David§. The sole outstanding case among the 

— v tan” 1 ~ // x 9 \ m 

Pearson curves is Type IV, y = y Q e « / i 1 4* ^ J , where a table of 

[ er ¥9 co& n 0d0 


* Biometrika , Vol. n. pp. 174—190, or Tables for Statisticians, Part I. pp. 1 — 10. 

t Published by Biometrika, 1988. 

x Comb. Phil. Soc. Trane . Vol. xm. Part iii. pp. 148 — 272. 

§ Biometrika , Vol. xxiv. pp. 844—846. 


26 — 2 



380 General Criterion for Random Sampling 


for values of v and n is required. By aid of these probability integral tables any 
. single value of a variate x t supposed to belong to one of these curves can have its 
probability integral p 8 determined, which measures the frequency of values arising 
as great as or greater than x 8 . Let us suppose a sample of size n containing the 
variates xi, x*, x* y ... x 8) ... x n to be drawn from a distribution y = <£(#) and let 
Pi>P*>P*> p»> ... p n be their respective probability integrals. What will be the 

distribution of these probability integrals ? 

Now p has a definite value for each value of x and by definition of the probability 
integral 

P“( <\>(*)dx (1), 

J a 

where is the start of <j>(x) and the whole area I (f>(x)dx=* 1. 

Ja 

Now the frequency with which p 8 occurs may be represented by F(p)dp , but 
this is the same as the frequency of x = (x) dx t or 

F {p) dp~ <f> (x) dx , 

but by (i) dp~<f> (x) dx. 

Accordingly F (p) 1, or the distribution for the probability integrals of any frequency 

curve is a rectangle on the base p =* 0 to p = 1* 

We note that this is not a discrete but a perfectly continuous distribution of 
frequency. Since all the a?'s are supposed to be obtained by random sampling, all 
the p'* will likewise be a random sampling distribution, and P lit>n the probability 
of that random sample occurring or one with the individual variates having a 
greater probability, will be given by 

Fl .n - P1P2 . . . p B . . . p n - n ( p s ). 

1 

We now add : Is this an improbable sample ? In other words we enquire : What 
is the probability of a sample with as great as or a greater value than P, „ 
occurring ? We need to find out the sum of all the samples with a probability 
P— >Pl...nt- 


* Bttyea ’ h yP°the8iB °f the rectangular distribution of probabilities, although it clearly does not apply 
to all probabilities, certainly does apply to all probability integral values. 

reader “ UBt a8 ™ rB himself at this point that we are adopting methods of treatment with 
which he u already familiar. That i B to say, we are extending the argument he ie acoustomed to use 
with regard to the probability integral for a single value x,; for if p, be extremely email, he argnee 
that it is reasonable to suppose that x, was not a random sample from the curve he has used to compute 
its probability integral. Now p, is the chanoe that a eertaln variate x, will not exoeed a value f, 
‘ hM * Beoo " d variat ® *• wiU not exoeed “ value *, and so on. Then, since H , ... x„ are supposed 
to be mdependent random samples it follows that P 1 ... n =p ] p, ...p. Is the chanoe that the combined 

system x,, x a , . x n will not exoeed the system x u We then torn the problem round so to 

speak and ask what other senes of x, , x a , . . . x. will give a probability exceeding P, _ . That is to say 

we divide up the possible field of p,, p.’s by a contour surface which separates the sub-field of 

tblTp " P “ ® 0mblned from the sub.ficld which gives greater probabilitiee 

!nrmri» the 1a less probabilities be very small, then we judge that it ie unreasonable to 

suppose the observed *„ x a . ... x. form a random sample from the system or systems of distribution 



Karl Pearson 


381 


To answer this we must first solve the following problem : 

We have a system of n rectangular axes in Euclidean space and a point Z\ , z t , . . . z n . 
This point is constrained to lie in the “ n-cuboid ” with all its edges equal to unity. 
A surface is given by the equation 

X n = *i* a ...* n (l)6is, 

which may be termed the “ n-hyperboloid,” we require to find out what volume of 
the w-cuboid is cut off by the n -hyperboloid. 

Let us denote the volume of the n-cuboid inside the hyperboloid by F n . Let 
us assume that 

V _1 % fi log.Xn . OogeK ) 2 (log, An) 8 , , , i x„-i Ooge K)^ l \ 

v n -i~\ n \i — — + — 2 1 3] +...+(— i) ) 

( 2 ). 

Now add another variate z n + 1, so that 

Zi z% . . . z n z n+ 1 X n+ i . 


If 1 be made constant the section of the (n-f l)-hyperboloid is an n-hyperboloid 
and the " volume ” of this is given by (2) above. We must integrate these strips of 
“ volume ” from z n +i = \«+i to 1 and accordingly 


V, 


« f V n dz n + 1 
J*n+, 

_r [,_U lc *s; , ( log - 

Ja*. L *»+A 1! 


K+iV 

*n+l / 


. fiog^TM 

_ „ +( -i)»-^ **»' ) 

+K ’ / j 


dz , 


n + 1 • 




A n +x 

z n + 1 * 


Suppose 
then 

Vn+1 = [ Zn+1 ]^, + K+1 f 0 ' + l 1 " fl + 2! ~ ■ " + (n — l)f ) dy 

1 -w .% /log, X„+i (log. X, l+1 ) 2 , (log, X^+x) 8 

= 1 - X, l+ i + \„+i ^ — r , “ 2 ! + 3! 

“•••+“]— (log e \ n+1 )») 

= 1 - w (1 - ^" +1 + ( ^Y +l)a - • • • + ( ^dog. *»«)») . 


upon whioh we have based the oaloulation of their probabiUty integrals. We do not assert that our test 
is the most stringent test, but that it is a very general and often easily applied system atio test of a given 
hypothesis. It may, indeed, be doubted whether any test is the most stringent throughout the whole area 
in whioh it has beoome customary to apply it. We may say of most tests, as of the present, that they may 
disprove any hypothesis, but that if they merely render a reasonable probabiUty for the hypothesis, we 
oannot be certain* that a more stringent test may not exist whioh would render the hypothesis very 
unlikely, or that a second hypothesis may not have a stiU greater probabiUty. 



882 General Criterion for Random Sampling 


or, if (2) holds for V„, it will hold for F»+i. But for n =• 2, Xj, and dearly 

-J0g,Xi). 

Hence the formula holds for V% and will therefore hold for V$ and so on by induction, 
and is thus generally true. 

We see accordingly that 

F»« l — x sum of first n terms of the exponential series for e 
But the first n terms of the exponential series e x 

»s*(l - J(n,— 

where I denotes the incomplete T-function ratio. Thus 


V n « 1 - Xn (1 - / (n - 1, - log. Xh)) 


*/(n-l,-log e \n) 


(3). 


Since X w must always be less than unity the incomplete T-fu notion ratio is real. If 
we use the Tables of the Incomplete^ -Function * , we must look out I (n — 1, w), where 


logdXn = _ logioXn 
fn Vnlogxoe 


.(4). 


Now let us return to our original problem. We have a sample of n variates 
. . . x n taken from a population following a given or supposed law of distribution 
of which we know the probability integral. The values of this for our n sample 
variates x x , x t , . . . x n are respectively pi , p ^ , . . . p n . These probabilities follow a rect- 
angular distribution. If we take \n — p x p%...p n — the probability* of the occurrence 
of the particular independent set of probabilities p Xi p % f ... p n — then the probability 
Pa* of a combination occurring with a probability value as great as or greater than 
Xn is given by 


iW(n~ l 9 -p &±-) 9 

\ VnlogioS' 


or if be the probability of a lower probability occurring, 


<«. 

where I (p, u) is the function tabled in the Tables of the Incomplete T -Function. 

The effectiveness of this method of approaching the problem of small random 
samples lies in the foots, (i) that grouping the individuals of small samples to obtain 
a X*) * 8 a somewhat hazardous proceeding when n is small, (ii) we do not 


H.M. Stationery Office, 1939. 



Karl Pearson 883 

make the somewhat rash assumption that binomials (p + qY in which p is not 
nearly equal to q , and 8 is small, may be replaced by a normal curve. 

Given that the probability integral of the supposed (or real) parent population 
is known, then we find Q ^ without any approximations or hypotheses. 

If Qx n be very small, we have obtained an extremely rare sample, and we have 
then to settle in our minds whether it is more reasonable to suppose that we have 
drawn a very rare sample at one trial from the supposed parent population, or that 
our hypothesis as to the character of the parent population is erroneous, i.e. that 
the sample wi, ... x n was not drawn from the supposed population. 

This “ settling of what is reasonable ” occurs not only with the (P, test*, but 
with the application of most criteria in statistics. It is not peculiar to the present, 
or, as I propose to call it, the (P, test. 

A table of P* n for n * 2 to 30 by units and — logio from 0 upwards by *125 or 
*250 can be formed from the Incomplete T-Function Tables , and when published will 
expedite the application of the (P, X*) test. Such a table is completed, and will 
shortly be published. 

Another very important attribute of this (P, \ n ) test may now be mentioned. 
Let us suppose we have any number of parent populations each following its own 
law of distribution, then if w t be an individual drawn at random from the sth 
population, its probability integral will be also a random sample from a rectangular 
distribution. Hence the individuals ... w n need not be drawn from the same 

parent population, but may be drawn from any number of populations, one or more 
from each. Their probability integrals p%, ... p n will ail be random samples from 
a rectangular distribution and may be combined to serve as a random sample of n 
from such a distribution. Our (P, A„) test is accordingly a test of randomness, and 
not a test of whether the series of individuals have been drawn from a single 
particular type of distribution — ^-distribution, <r* distribution, normal distribution 
or what not. If the sample of n gives a highly improbable result, then we doubt its 
randomness. This want of randomness may arise because the selection of them or a 
certain number of them has not been random, or because their probability integrals 
or some of them have been calculated on the basis of a hypothesis as to their parent 
population or parent populations whioh is in itself incorrect, e.g. a sample of 
might have their probability integrals calculated from a normal curve, whereas in 
fact they were really a random sample from a Type I curve. The probability integrals 
would not then appear as a reasonable random sample from a rectangular distribu- 
tion. When we conclude that our sample of probability integrals is very improbably 
random, we must turn to other sources to determine whether it is owing to the 

* There exists a relation between the (P, x*) and (P, X*) tests for which, I think, some explanation 
must be forthcoming. Namely, P x t can be found from the l (p t u) Tables by taking 

Jjjfcn) 

Hence if we take out the P x » corresponding to n'=2n + l and **= - 2 log w XJlog w e, th en P x * will 
equal Qx* ; and thus within their range the (P, x a ) Tables may be used to and 



884 General Criterion for Random Sampling 

selection being biased, or to an erroneous hypothesis as to the parental distributions 
on the basis of which we have computed our probability integrals*. 

As a result of the above we see that this new (P, \ n ) method enables us to form 
combined tests, some of the #’s may be means, others standard deviations, others 
correlation coefficients, etc. etc. All we need do, if these quantities are uncorrelated, 
is to calculate their probability integrals from the appropriate distributions using 
when necessary the most probable values, as determined from the samples, of the 
constants required in the parental distributions. Illustrations of such combined 
tests are given below. 

If we suppose the probability integrals, p 8 (s=*l, 2, ... n), to be uniformly 
distributed, for example, by dividing the range 0 to 1 into n equal sections, and 
placing a value of p 8 in the centre of each of them, so that p t takes the value 
(2 8— l)/2 w, then the corresponding % a would be zero, and P x » = 1. There is some- 
thing difficult about this. The result agrees with the mathematical expectation, "one 
ball in each compartment/’ but the basis of the theory is really too narrow f. The 
probability of the result must depend upon where the p’s fall in each compartment. 
Let us consider what happens in the corresponding case of P A|I . 

We have 

. 1.3.5.... (2tt-l) _ (2 \n)\ 

^ <2n) n nl 2 a>t n n 

* This dilemma as to randomnesR oocurs as far as I oan see in most published tests, e.g. we assume 
as hypothesis that the two samples were drawn from the same normal population, and find it very 
improbable, this may really be due to bias in the taking of one or other sample and not to the absence 
of normality. Only further knowledge or investigation oan lead to a discrimination betwoen the two 
possibilities. For example, we have the means and standard deviations of two samples, and we wish to 
ascertain whether their parent populations are differentiated. We adopt the r-test. In doing so we 
make two hypotheses, (i) that of normal distributions, (ii) that both samples are truly random. Hence 
we have at least three possibilities, (a) that one or both parent populations have not normal distributions, 
(b) that one or both samples are not random and (c) that the two samples do not come from the same 
normal population. It seems to me that many users of the test, when they get an improbable result 
assume straight off that (c) must be the origin of it, and do not question the possibility of (a) or (6) 
being the souroe of the observed improbability. The P ^ test is no worse (or no better) than the other 
tests in this respect, i.e. that we have to consider whether the observed want of randomness in the 
distribution of the probability integrals is due to the hypothesis as to the nature of the distribution 
curve or to want of the other randomness in the process of sampling. It is unfortunate that the same 
word “ randomness ” has to be used in two plaoes in the same investigation. The- want of “ randomness ” 
in the distribution of probability integrals may or may not be evidence of want of randomness in the 
sampling. 

+ Another point is common to other tests as well as our present test, although at times the point is 
overlooked, namely a result may — to speak paradoxically —be so highly probable as to be wholly im- 
probable. My memory oarries me back many years to a memoir I read in manuscript. It oonoerned 
the distribution of a character in over a thousand offspring of certain sets of parents; the character 
being treated as involving five or six Mendeiian factors. The classified offspring distributed themselves 
absolutely accurately according “to the mathematical expectation”; but the odds of course against such 
an occurrence were immense, and this was pointed out to the writer. The memoir afterwards appeared in 
print, the “improbable high probability” having disappeared. This development not only convinced 
me of the elasticity of Mendeiian categories, but led me to realise how great is the risk that a biologist, 
ignorant of mathematical statistics, may heedlessly run in his enthusiasm for a hypothesis. His results 
may be “too good to be true.*’ 



Karl Pearson 


385 


sfn /I 1 \ 

0r M = [- l °gio (» !) + logto « + 2 logjo 2 - - logio((2«) !)J (7). 

Applying Sterling’s Theorem to the value of e ~ , we find for the asymptotic value 

of tt, 


U n -+.ao 


v,r- ‘Ml — 

2 Vn x -434,2945 


( 8 ). 


Accordingly we have the following results, the first two being obtained by linear 
interpolation from the Tables of the Incomplete F-Function and liable to an error of 
a unit in the last decimal of Pa*. 


n 

u 

I(»-l, u) 


5 

2-08479 

/ (4, 2-08479) 

•5018 

10 

3-05400 

/( 9, 3-05400) 

•5008 

100 

9-96561 

/ (99, 9-96561) 

•5004 

500 

22-34523* 

/ (499, 22-34523) 

•5002 

1000 

31-62763 + 

/ (999, 31-62763) 

•5001 

00 

CO 

i(oo, ®) 

•5000 


Thus we see that Pa* rapidly approaches the value 5; in other words, in an 
absolutely uniform distribution of the probability integrals p B there would be as 
many distributions above as below this value. This is what we might a priori 
expect, but it is suggestive in endeavouring to interpret the relation of P Xn to P x *. 
We will return to this point shortly. 

In order to determine the values of the incomplete T-function for 99, 499 and 
999, recourse was had to the E method of p. xvi of the Instructions as to the use of 
the Tables%. This method is peculiarly advantageous in the present case both 
practically and theoretically. 

Ifp = n—1, and v- Vnu*= + lu=p + d', 

£ . os + » (tf - <0 + 

. - 15 »r - df) + + ?» w >. _ ^ . ,. (9)i 


where 


and 


y» 


(p - 2 )** «-«’-*> 

■ ro»+i) * 

(p + l)Pe-« ,+l » 

r<p + i) 


p p e~ p 

yt= f\ P ~+iY 


y* 


(p+3)i’e-« ,+8 > 

! r u+tt ' 


, -0197,5309 , -0072,1144 -0003,8554 

d = -6666,6667 + + -£+TF + (10) 


gives the distance from mode to median. 

* Differs by five units in the last figure from the asymptotic value, 
f Asymptotic value. 
t Tablet of the Incomplete Y-Funotion. 




886 General ’ Criterion for Random Sampling 

Working to four figures only the first two or three terms in (9) and the first 
two in (10) suffice when p is of the order 100 and more, and the values of the y * s 
are easily determined. Now v «* p + d' and the asymptotic value of 

vm^/nu** ^ ) 

V 2 Vn x -434,2945/ 

Thus d' — a very small quantity, i.e. approaches *01824, and since y% is of the 
order 01 to *02 even the second term of (9) only influences the fourth decimal place, 
and we see how the approximation to *5000 arises. 

Now while the value of P** remains the same wherever we place the p 9 in the 
n divisions, the value of P^ will vary according to their position in those divisions, 
and it seems very proper that it should. We will consider three cases of the p ' s 
for n«5. Instead of taking the values pi = *1, p a «• 3, p 3 = *5, p 4 » *7 and p 5 «* *9 we 
will: 

(A) Give the p’s larger values close up to the boundaries of the divisions, 
namely: pi ** *19, p a * *39, p 9 =* *59, p 4 = *79 and p 6 = *99. 

(B) Give the p’s smaller values close up to the opposite boundaries of the 
five divisions, namely: p x = *01, p 2 = *21, pa = *41, p 4 « *61 and p 6 = *81. 

(C) Take values drawing up the p’s closely to the central value *50, namely : 
pi -■ "19, pt = *39, p 3 * *50, p 4 * *61, p 6 = *81. 

We have Vw logiotf «■ *971,1120, and accordingly 

(A) log \ n = - 1*466,0675, 1*5096, Q\ n = *7486, 

(B) log Xn = - 3*371,1820, w = 3*47147, Q^-1142, 

(0) log - - 1*737,3970, w « 1*78908, - *6287. 

Now we notice that while P**= 1 for all these cases*, Qa b — while in none of the 
three caseB giving an improbable result — is far from giving identical measures for 
the three distributions of the p’s. When we move the central points towards the 
greater values of p«, i.e. in (A), we find that 75 °/ e of cases have a less degree of 
probability; when we move them towards the lesser degrees of probability, i.e. in 
(B), we find that only about 11 °/ 0 have a lesser degree of probability, and finally 
when we endeavour to concentrate towards the centre of the entire range, i.e. in (C), 
we find the intermediate value, namely about 63°/ 0 of cases have a lesser probability. 

This illustration will I think suffice to indicate some advantages of the P** test 
over the P*t test. 

Illustration 1 . Use of the Probability Integral from the Table of the Normal Curve . 

The mean length of life of 15 samples of five electric lamps is provided by 
EL S. Pearson. 

* f«>m the questionable repl&oing of a binomial by a normal curve in each a oase 

(*+*)*« 



Karl Pearson 


387 


TABLE I. 

Length of Life of Lamps in Hours*. 


Sample 

No. 

Mean 

Standard 

Deviation 

Sample 

No. 

Mean 

Standard 

Deviation 

1 

1295 

440 

9 

1715 

385 

2 

2005 

435 

10 

1660 

460 

3 

2445 

680 

11 

1935 

560 

4 

1900 

345 

12 

1760 

280 

5 

2570 

290 

13 

2175 

465 

6 

1980 

510 

14 

1670 

506 

7 

1990 

445 

15 

1670 

360 

8 

1990 

315 

— 

— 

— 


Is it reasonable to suppose that these 15 five lamp means have been drawn 
from the same normal population? 


In order to answer this question we must determine what are the most probable 
values to assign to the mean M and the standard deviation 2 of this supposed 
common parent population. Let the samples be v in number, and of different 
sizes n t , the mean and standard deviation of the ith sample being m t and s t . Then 
the most probable values of Af and 2 a will be obtained by making the following 
expression a maximum : 


V | - 8 4 

E = n ( n, ) -r--- - e '- 1 
t- 1 (V2t r)<’2» 


Jlf) a 
2* ~ 


1 

X 


t-V 

II 


V-* t.1 r(i(n < -2)) 


/.i» (11). 




’ 2/V2 «« V2/V 2n t / 


Taking the logarithmic differentials with regard to M and 2 we find 

S n t (m t — Af)/2 a * 0, 


or 


where N-8(n t ) 

tml jy 


( 12 ), 


thus M is the weighted mean of the sample means. Further, after differentiating 
and collecting terms we find 

T _ Sn t (s t * + (m t -M)') (18)> 


Clearly (12) and (13) simply amount to saying that the best value to give M and 2 
is that obtained by pooling all the individual values in the samples and finding the 
mean and standard deviation of the combination. We may note here that if our 


Tablet of the Incomplete B-Function, Introduction, p. lii. 




888 


General Criterion for Random Sampling 


hypothesis were that our samples were drawn from independent parent populations, 
but with the same variability, we should have to replace the M in (13) by Mi and 
differentiate with respect to every M t . This gives us M t ~ m u and 




Sn t (a* 8 ) 

N~~ 


(14), 


i,e. the weighted square of the variances. The hypothesis that M t varies from 
sample to sample may indicate, however, a secular change in M t and so involve 
correlation between successive samples, which has not been allowed for in (11) 
where the samples are considered independent. 

We proceed to find M and 2 for the data of the electric lamps. The needful 
calculations are provided in Table I. If we have determined M and 2, then the 
m ! s are distributed normally with a mean M and standard deviation 2/Vw, so that 
we have only to look up ( M - m t )/(2/Vw) in the table of the normal probability 
integral to obtain the series of probability integrals, p t9 in column (h) of our table. 
Great care must be taken to regard the sign of ( M — m t )/( 2/V n) as the areas must 
all be measured from one end of the normal curve. 


TABLE II. 


Sample 

No. 

W 

Mean 

(*) 

m t - M 

(d) 

(to, - M f 

(«) 

</) 

«< s 

w 

<c)/(S/n/») 

« 

Pt 

« 

login Pb 

1 

1296 

-615 

378225 

440 


-2*555 


4*716, 0033 

2 

2005 

+ 95* 


435 

189225 

+ -395 

•65536 

1-816,4799 

3 

2445 

+ 535 

286225 

580 

336400 

+ 2*222 

•98686 

1-994,2555 

4 


- 10 


345 

119025 

- -042 

'48362 

1-684,4144 

5 

2570 

+ 660 



84100 

+ 2-742 

•96946 

1-986,6299 

6 


+ 70 



260100 

+ -291 

•61447 

1-788,5007 

7 


+ 80 


445 

198025 

+ -332 


1-799,3750 

8 

1990 

+ 80 


315 

99225 

+ *332 


1-799,3750 

9 

1715 

— 195 

38025 

385 

148225 

- *810 

•20897 

1-320,0839 

10 

1650 

-260 


460 

211600 

-1*080 


1-146,3451 

11 

1935 

+ 25 

626 


313600 

+ *104 

*54141 

1-733,5263 

12 

1760 

-150 


280 


- *623 

•26664 

1-425,9253 

13 

2175 

+265 

70225 

465 

216225 


*86455 

1-936,7901 

14 

1570 

-340 



265025 

- 1-412 

*12631 

1-101,4377 

15 

1670 

-240 


380 

144400 

- -997 

•35963 

1-203,1145 

i^8um= 

Jf-1910 

^ sum— 99937 

^ sum = 189,812 

— 

logX*- 

-8-547,8434 


3*«99, 937 + 189, 812=289, 749, Vttlog 10 * = 1*682,0153, S-538’28, and 2/<Jn =240*7261. 


Thus 5*081,906, and we require /(14, 5*081,906); we find this from the 
Tables of th<e Incomplete T -Function to be *88209, Hence Q n ** '118: or, the number 
of more improbable sets of 15 samples is 11*8 %• 













Karl Pearson 


389 


We cannot on the result of this test assert that the lamps’ lives were certainly not 
samples from the same population. 

Illustration 2. Use of the Incomplete B-Function Table to determine the Proba- 
bility Integrals . We may use the data of Illustration 1 to exemplify this method. 
“Student” deduced* that if M was the mean of the parent population and m t the 
mean, s t the standard deviation of a sample of size n*, then the distribution of 
s% » ( m t — M)/8i is given by the curve 

(15) * 

We require the probability integral of this curve, and various tables have been 
computed for it, either for z t or some modified form of z t . I personally have found 
nothing so comprehensive and convenient as the Incomplete B-Function Table . 

The probability integral is 


p(n, z )» 


dz 


=/f 


dz 

•® (i + 


l-»(l +2*)W 


Put 


*3 = 


1 — x 


or x * 


1+s 1 ’ 


then 


p (n, z) m *5 « 1 *f | x~i (1 — dxjj x~$ (1 — dx • . 


But the ratio of the two integrals = /&(£, i (n — 1)) or the incomplete B-function 
ratio, the quantity I x ( p , q) tabled in the corresponding book of tables. Thus we 
may write 

-*{2-/,^(*(n-lX *)} 

-*{* -I l (i (*-!),*)} (16). 

The reason for this change of expression is that the Tables of the Incomplete 
r -Function are provided for p ^ q, We must remember, however, as especially 
important that we have got to allow for z being negative. Accordingly we take 

p(n, + z) = l-jf i (|(w-l), |)'| 

■p(n,-z)m i (i(»-l), i) 

1+i* 


.(17). 


For present purposes linear interpolation into the B-function table will suffice. 
The following table indicates the needful work. M remains « 1910. 


Biometrika , Vol. vi. p. 8. 



890 


General Criterion for Random Sampling 


TABLE IIL 


Sample 

No. 

m t -M 



1 + *f 9 

i-* ( 

=t/a+v) 



1 

-616 

440 

-1*39772 

2*95362 

*33856 


3-322,2193 

2 

+ 65 


+ 0*21839 

1*04769 

*95448 


1-860,6850 

3 

+ 535 

580 

+0-92241 

1*85084 

*64030 

*9776 

1-990,1612 

4 

- 10 

345 

-0*02899 

1*00084 

•99916 

•4915 

T’691 ,6236 

6 

+ 660 

290 

+ 2*27686 

6-17954 

•16182 

•9997 

T-999,8697 

6 

+ 70 

510 

+ 0*13725 

1-01884 

•98151 

*6355 

1-803,1156 

7 

+ 80 

445 

+0*17978 

1*03232 

*96869 

*6753 

1-829,4967 

8 

+ 80 

315 

+0*25397 

1*06456 

•93940 

*7384 

1-808,2917 

9 

-195 

385 

-0*50649 

1*25653 

*79584 

•1111 

T-045,7141 

10 

-260 

460 

-0*56522 

1*31947 

•75788 

•0893 

2-960,8615 

11 

+ 25 

560 

+ 0*04464 

1*00199 

*99801 

*5201 

T-7 16,0869 

12 

-150 


-0*53571 

1 • 28698* 

•77701 

*0997 

2-998,6925 

13 

+ 265 

465 

+0*56989 

1*32477 

•75486 

*9123 

1-960,1377 

14 

-340 

606 

-0*67327 

1*45329 

*68809 

•0590 

2-770,8620 

15 

-240 

380 

-0*63158 

1*39889 

*71485 

*0693 

2-840,7332 

1 


As before n log 10 e *= 1 *682,01 53, S log p, t ■■ - 9*361,6694 « log A* . 


Thus w - 5*565746, and « 1 - 1 (14, 5*56575), 

where the latter function is the T- function ratio. Interpolating linearly from the 
Incomplete Y -function Table , we have 

Q n «1- *93041 - *0696. 

Thus some 7 °/ c of series of sample would provide a greater degree of im- 
probability. 

Dr Egon S. Pearson has applied to the same data in the Introduction to the 
Tables of ike Incomplete B- Function a third test, namely a modification of Fisher's 
test for determining whether the regression of one character on a second is a hori- 
zontal straight line. This involves the computing of 17 *®= 8 and the 

determination of /*-.,•(£ (W — v), i(v— 1 )) from the Tables of the Incomplete B- 
1 Function. Thus as in our Illustration 1 we have to determine the most probable 
values of M and 2, but there is only one value to be found from the table, not 15 
from that table, and one from the Incomplete Y-f unction Table . In this case 
17 *** *3449 and we have Q Kn = / .ewi (30, 7) » *0152, or there are 1*52 % of cases 
only more improbable. It will be seen that this result is more stringent than 
either of our previous tests. Can we find a reason for this ? Fisher's test is based 
on a triple hypothesis: (i) linearity of regression, (ii) homoscedasticity of arrays 
and (iii) their normality. Now the approach to a normal distribution of means is 
fairly rapid even for distributions not absolutely normal. Further the formulae ( 12 ) 
and (13) for M and 2*, while we have deduced them from a normal distribution, 
are most reasonable formulae to take in non-normal distributions, and lastly no 
assumption is made as to homoscedasticity of arrays. Accordingly we might 









Karl Pearson 891 

anticipate that our test in Illustration 1 would cover a wider range than Fisher's 
triple hypothesis does, and so it appears from the result. 

Again in the s-test of the present Illustration we do suppose that each sample 
is taken from a normal distribution, and that all these normal parent populations 
have the same mean, but we do not insist on the standard deviations of all 
these parent populations being the same. The probability integrals of the several 
z % can be combined, as they are random selections of p t between 0 and 1. Thus 
again the hypotheses involved do not seem so stringent as in Fisher's case. 

With regard to the comparison of the tests in Illustrations 1 and 2, it is not at once 
obvious why the selection from v normal populations having the same means but not 
necessarily the same standard deviations should be more stringent than the 
selection from v populations having the same means and standard deviations but 
not necessarily strictly normal. 

Illustration S. In an experiment* in which a certain number of children were 
given raw milk for four months and the same number of children of closely the 
same age, stature and weight were given pasteurised milk for the same time, the 
following system of mean growth differences, standard deviations of those differences 
and the ratios (z’s) of those differences to their standard deviations were obtained. 
The numbers are sufficiently large to admit of our computing the probability 
integral of z from the normal curve table : 


TABLE IV. 


Boys. 

No. 

Baw Mean — 
Pasteurised 

Standard 


Probability 


Central Age 

of 

Deviation 


Integral 

loBwP* 

in years 

Pairs 

of Difference 


of 2 t —p t 

6| 

73 

-•066 

•054 

— 1*22 


T*948,7882 

1*532,6308 

7§ 

mum 

+ •022 

•053 

+ 0-41 


8} 

71 

— *003 

•052 

-0-06 


1*719,2869 

1-624,0141 

9f 

77 

+ •011 


+ 0*20 


lOf 

60 

+ •002 

•057 



+ 0*04 

1 

T-684,8872 



(TAe units are inches.) 


Total £-609,5872 


Thus : logio As = 1*490,4128, V n logw e = V5 x 434,2945 * *971,1120 and accordingly 
1*53475. 


The volume within the 5-hyperboloid — I (4, 1*53475) « *262, or the probability 
of a set of values of z with a less probability than those observed is *738. That is 
to say that if the system of z' s were really drawn from normal populations about 
74% of the cases would be less probable. We cannot accordingly assert that 
there is any difference ip growth in these boys according as to whether they took 
raw or pasteurised milk. 

* E. M. Elderton, “The Lanarkshire Milk Experiment/ 1 Annals of Eugenios , VoL v. pp. 826—330. 











392 


General Criterion for Random Sampling 

Illustration 4. In the experiment referred to in the previous Illustration the 
following results were obtained in the same manner for the Weight of Girls when 
Haw Milk was administered and when no milk Was given, the pairs being taken 
of closely the same Age, Stature and Weight. 

TABLE V. 


Gums. 

No. 

Raw 

Standard 


Probability 


Oentral Age 

of 

Milk— 

Deviation 


Integral 

l°8ioP< 

in years 

Pairs 

Control 

of Difference 


of z t =p t 


6| 

144 

+ 0*13 

2*02 

*06 

•476,0778 

1-677,6799 

7| 

128 

+ 1*12 

2*41 

*46 

•322,7681 

1*608,8771 

8| 

133 

i + 7*98 

2*06 

3*00 

*001,3499 

3*130,3016 

»| 

133 

4- 5*62 

2*77 

2*03 

*021,1783 

2*326,8911 

10J 

lift 

+ 11*66 

3*27 

3*57 

*000,1785 

4*251,6382 


( The units are ounces .) 


Total 

10*894,3879 


Thus : logxo ■* + 9*105,6121, V n logio e — *971,1120 and accordingly u = 9*37648, 
and the volume inside the hyperboloid * 1 (4, 9 37648) = *999,9922, or the proba- 
bility of a set of z } s occurring with as great as, or a greater improbability than this 
set is only *000,0078. We should accordingly argue that the raw milk feeders and 
the controls can only be, with the highest degree of improbability, random samples 
of the same population, i.e. the raw milk accelerated the growth of the girls 
(especially the elder girls) in weight. 


Illustration 5. Use of Probability Integral of Correlation Coefficient . A number 
of coefficients of correlation, r lt 7 * 2 , . r*, ... r u , are found from samples of sizes 
Ui» n$ t ... nt, ... riu. The corresponding means and standard deviations for the 
samples, the variates being a; and y , are <c, y, o-^, a Vx \ x 2> y 2> cr*,, cr Vt ; ... ; 

y«> o-y t \ x u , y u , o$ u . Each correlation may come from a parent 
population means m %t> my v standard deviations 2 y< , but these parent popula- 
tions are supposed to have the same value of the correlation coefficient p. What is 
the most probable value to give to p, and what is the chance that the populations 
from which the u samples are drawn really have the same correlation ? 

We suppose the distributions in the case of each of the u parent populations 
to be normal, and we will take Snt=*N, The distribution of the £th population, 
if we suppose oc ta , y e<j to be any member of it, will be 


Z 5 35 


M 

2 tt 2 » < 2 vi (1 — p*)l 


1 ](***'-*«)* {Vt^-yd* 

+ i ...( 18 ). 


If we take the product of such expressions for all values of a for the £fch sample, 
we find that the chance of such a sample arising from values of x and y lying 





Karl Pearson . 898 

* 

between «« a and <®t fl + &>«., y« a and y^-t-Sy*,, and similar values for the % other 
P«wb of coordinates may be thrown into the familiar form 

V— 

V2w2 a< 2 v / (1 — />*)*"» 

x« *Tw*)L 2>., + ^ ^ J 

(19). 

We have a similar value for each of the u sets of samples. Now we suppose 
the u sets of samples to be independent and further nothing known about the 
constants of the w-parent populations except that they have the Bame p by hypo- 
thesis. Accordingly we have to make the product of u expressions like the above, 
a maximum by choice of the 4u+l variates x t ,y t , 2*,, 2 V< (* = 1, 2, ... «) and p. 
As they are independent, we can differentiate the single values like (18) to deter- 
mine the values for the first four types of variates, but we mnst differentiate the 
combined product to obtain the value of p. We have at once my t ~ yt ■ 

Inserting these we have to maximise the expression 


-I (. 

- o*)i n t V: 


1 W - sir 1 


(1 


) 


2(1 -V) u 








2pr t <r 

~s. 


’t <r x i <r v t 'l a**m 


( 20 ) 


to find the proper values of 2*, and 2„ r If we take the logarithm of this and 
differentiate with regard to 2*, and 2,,, we find, after dividing out a factor, 


0 = 

0 ** 


_ 1 _i L. ( a2 *t __ P r t a x$<*yt\ 

1 + £*2* )> 

_ 1 _l ^ / °*Vl _ P r t a ‘«t a 'yt\ 


and accordingly by subtraction 
be positive, 




; or, since the standard 


deviations must 


o* xt l-p* g* yt 

2% 1 -pr t 2 a v< 


(21). 


We have now to differentiate the product of u expressions like (20) with regard 
to p, where after differentiation we can make use of (21). The required expression 
as far as p is concerned is 


1 

(1 ~p¥ n> 


1 t~u f-<r% 




t-u a**m 

t l\ 


Assuming that it is allowable to consider the double product independent of 
2* and p we have, by taking a logarithmic differential with regard to p, 

Omm P *S U ru . 1 %“n t r.^w w 

0 a -/»*>* 


Biometrika xxv 



894 General Criterion for Random Sampling 

* 

#**» 

Now making use of (21) we find, since S n t **N, 



Our (21) agrees with the (B) equation of Mr Brandner’s (11)* and our (22) agrees 
with his (11) (C), if we take the simple case of m®* 2, when N —ni + n*. 

Returning to (22), let us write 

( 28 ), 

or, fi v is the weight mean of the vth powers of the r t ’s. Accordingly we may write 
our equation for the most probable value of p in the form 

P - Pi + p (p* - p 8 ) 4- p 2 (p 8 - p 8 ) + p 8 (p4~ p 4 ) + P 4 0*« ~ P 5 ) + (24). 

This equation allows us to make rapid approximations to the value of p , according 
as to what power of the correlation coefficients may for a practical purpose be 
considered negligible. Thus 

pi = fn mt mean of the r%\ s, 

pt^M'l + pi (p>2 — pi a ), 

Pa = Pi + pa (pa - Pi 2 ) + pi 2 (ps - Pi 8 ), 

p4 = Pi + ps (pa - pa 8 ) + pa* (ps — Pa 8 ) 4* pi 8 (p4 — pi 4 ), 

and so on (25). 

The following example is taken from the paper by E. S. Pearson and S. S. 
Wilks in the current issue f: 

It is assumed that the frequency of Head Length and Head Breadth in skulls 
follows a normal distribution. Samples of 20 skulls were taken from 30 different 
races and their correlations calculated. The dimensions of the skulls and their 


TABLE VI. 

Racial Correlation Coefficients for equal small Samples of Thirty Races . 


Race 

n 

Bace 

n 

Race 

■s 

Race 

U 

Race 

n 

1 

+ •097 

7 

-•037 

13 

+ •319 

19 

+ *018 

25 

+ •245 

2 

+ •198 

8 

+ ‘667 

14 

+ •310 

20 

+ •160 

26 

+ *360 


+ •676 

9 


16 

+ •019 

21 

+ •178 

27 

+ •592 


-•016 

10 

-•112 

16 

+ •446 

22 

+ •763 

28 

-•616 


+ •173 

11 

+ •219 

17 

+ •410 

23 

+ •101 

29 


6 

+ •764 

12 

-•162 

18 

+ •946 

24 

+•449 

30 

+ •264 


* Diometrika , Vol. *»▼. p. 104, 
f Bipmetrika, Vol. aocv. pp. 878 — 874. 




















Karl Pearson 


396 


standard deviations for the SO parent populations are supposed unknown, but 
undoubtedly differ significantly. The problem is whether the correlations based 
on these small samples can be considered random samples from parent populations 
having a common correlation coefficient*. It is unnecessary to repeat the names 
of the races here, as if the problem were to be considered in earnest, we should 
not take all the w/s equal to the small number 20, but use all the skulls available 
and weight the r t * s, using formulae (22) and (23). 

The first four moments obtained by adding the powers given in Barlow's Tables 
(Edition, Comne) are as follows, due attention being paid to the sign of r* : 

/n* *2489,6667, *1568,5010, ^ » *0904,4030, ^-‘0677,04 55. 

Hence p x = mean, r t » *2489,6067. 

Thus we have 

Pi 2 « *061 9,8440, px 8 = *0154,3205, px 4 = *0038,4207. 

Pi « *2489,6667 + *2489,6067 (*1568,5010 - *0619,8440) 

* *2725,8507, 

whence p a a * 0743,0262, p a 8 = *0202,5378, 

p 8 = *2489,6667 + *2725,8507 (1568,5010 - *0743,0262) 

+ *0619,8440 (*0904,4030 - *0154,3205) 

*•2761,1503, 

whence p s a * *0762,3951, 

p 4 - *2489,6667 + *2761,1503 (1568,5010 - *0762,3951) 

4- *0743,0262 ( 0904,4030 - *0202,5378) 

+ *0154,3205 ( 0677,0455 - *0038,4207) 

* *2774,2504. 

We may therefore take as the most probable value of the correlation coefficient 
in the series of u parent populations to three decimal places, p ■» *277. 

Starting with this value of p we have now to find for the 30 values of r, given 
above, their probability integrals. Miss David's Probability Integral Tablef for 
the Correlation Coefficient will enable such values to be determined. Meanwhile 
she has provided the values for p =* *277. The probability integral table for this 
value of p runs as follows : 

Thus, if the samples were all derived from populations having the same corre- 
lation we should only in less than 4 % of cases get a more improbable result than 
that observed. It is thus unlikely that the thirty races have the same correlation 
coefficient between Head Length and Head Breadth. 

Messrs E. S. Pearson and S. S. Wilks, by applying the approximate method of 
Fisher, obtain a 96*01 and a probability P*« < *000,030 that the correlation 

* A priori the hypothesis of common correlation is extremely unlikely, for if these races were pro- 
duced by selection from a common Stock, that selection would modify the correlations, 
f Manuscript Table shortly to be published. 


as— a 



396 


General Criterion for Random Sampling 


TABLE VIL 


Ordinates and Probability Integrals of samples Probability Integrals and their 
of 20 from a Parent Population of correlation logarithms for observed values 
p = 0277. of r. 




Prob. Int. 




Ordinates 

correct to 


a* 



5 figs. 



-1-00 
— -95 

— 

— 

— 

- 

-'90 

-•86 


— 

— 

— 

-•80 

*01 

— 

— 

l 

-*75 

•03 

— - 

1 

-2 

-•70 

•13 

*00001 

— 

4 

- *65 

*45 + 

*00002 

3 



-•60 

1*27 

•00006 

6 

6 

-•56 

3-17 

•00016 

15 

3 

-•50 

7*10 

•00041 

27 

12 

-•45 

14.56- 

•00093 

61 

11 

-•40 

27*72 

•001,96 

86 

17 

-•35 

49-59 

*003,85“ 

138 

23 

-•30 

83*92 

•007,12 

213 

20 

- *25 

135-13 

•012,52 

308 

24 

-•20 

207-94 

•021,00 

427 

21 

-•15 

306-81 

•033,75“ 

567 

12 

-•10 

435-13 

•052,17 

719 

- 3 

-06 

594*16“ 

•077,78 

868 

- 19 

•00 

781-89 

•112,07 

998 

- 43 

•05 

992-07 

*156,34 

1085 

- 70 

*10 

1213*39 

•211,46 

1102 

- 96 

•16 

1429*44 

*277,60 

1023 

-105 

•20 

1619*53 

•353,97 

839 

-117 

•25 

1760*79 

•438,73 

638 

- 98 

•30 

1831*32 

*528,87 

139 

- 60 

•36 

1814*42 

*620,40 

- 320 

- 5 

•40 

1703-00 

•708,73 

- 784 

67 

•45 

1503-12 

•789,22 

-1181 

138 

•50 

1235*43 

•857,90 

-1440 

184 

•56 

933*13 

*912,18 

-1615 

195 

•60 

635*96 

*951,31 

-1395 

162 

•65 

381*09 

*976,49 

-1113 

76 

•70 

193-23 

•990,54 

- 755 

- 22 

•75 

78*10 

•997,04 

- 419 

- 91 

•80 

22*75 

*999,35 + 

- 174 

-120 

•85 

3-95 

*999,92 

- 49 

- 84 

•90 

•33 

1*000,00 

- 8 

- 33 

•95 

•00 

1*000,00 



- 8 

1*0 

*00 

1*000,00 

— 



d 6 is negligible. 


r t 

Prob. Int. 
correct to 
4 figs. 

Logarithm 
of P. Int. 

+ •097 

•2086- 

1*319,1061 

+ •198 

*3507 

1*644,9368 

+ *576 

•9418 

1-973,9687 

-•0X6 

•1008 

1*003,4606 

+ *173 

•3115 + 

T-493,4681 

+ •764 

•9980 

1-999,1306 

-•037 

•0868 

5-933,4873 

+ *667 

•9823 

1-992,2441 

+ •014 

•1234 

1*091,3162 

— *112 

*0472 

5-673,9420 

+ •219 

•3853 

1-686,7990 

-•152 

•0331 

2-619,8280 

+ •319 

•5637 

1*751,0480 

+ -310 | 

•5473 

1*738,2254 

+ •019 

*1277 

1*106,1909 

+ •445 

•7816 

1 ‘892,9846 

+ •410 

•7256 

T -860,6973 

+ •946 

1*0000- 

*000,0000 

+ •018 

•1268 

1-103,1193 

+ •160 

•2921 

1 1-465,5316 

+ •178 

•3100 

1-491,3617 

+ •763 

•9979 

1-999,0870 

+ •101 

•2127 

1 -327,7675 

+ •449 

*7877 

1-896,3608 

+ •245 

*4300 

1-633,4686 

+ •360 

•6386 - 

1-805,1609 

+ •592 

*9460 

1*975,8911 

-•515 

*0003 

4*477,1213 

+ •023 

*1311 

1*117,6027 

+ •259 

*4458 

1-649,1401 


log -17*578,6760 

\/30 x -434,2945 - 2-378, 7289*, 
or u =*7-389,902. 

Accordingly 

§X n -l-/(29, 7*389,902) 
«l-*963»*037. 


coefficients between head length and head breadth are not the same for all the 
thirty races. I am not prepared to state whether the extreme difference in this 
case is due to the test applied being really more stringent, or to the fact that 
Fisher's approximate x-test for r can give exaggeratedly improbable values in the 





Karl Pearson 


case of outlying values of r such as those of + *946 and — *592 attributed to the 
Guanche and the Turk. 

Illustration 6. Use of the Incomplete T -Function Table to find the Probability 
Integrals . We proceed first to indicate the T-function expression for the Probability 
Integral p n of the Standard Deviation s of a random sample of n drawn from a 
normal curve of mean M and standard deviation 2. 

The equation for its* distribution is 

y ~ ya \lWn) 6 Kxi Lda] (26) - 

Take x — \ > an< ^ we have , 

w-3 

y = y 0 ' x 2 dx (27)* 

Therefore the probability integral in the form ready for entering the incomplete 
T-function table is 

r/i / ox ns 2 \ 




^ V v " V2(^T)SV 

If we suppose the means and standard deviations are due to random sampling 
from a common normal population, then we have, by (12) and (13), 


t- V 

S ( n t m t ) 

M=— 

N 


S n t (s t a + (m t -MY) 

and 2*= ^ „ . 

N 


If we apply these results to the data in Table I we have S* = 289,749 and n t * 5 
for all values of t. Hence 

P»i= S * 7 ( l < 163907 / ‘ 

Taking the values from column (/) of Table II we find 


TABLE VIII. 



N(logp n< )«logX n = -5*756,3691, V^logioe® 1*682,0153, 3*42233. 


Thus: / (14, 3'42233)« 3512, or some 65% of series of 15 samples of 5 lamps 
would have a more improbable set of standard deviations. 










898 


General Criterion for Random Sampling 

Illustration 7. In Illustration 1 we have obtained the probability integrals of 
15 tests of means and in the last illustration of 15 tests of standard deviations. 
As we have seen, probability integrals may be combined, and this is possible here 
because there is no correlation in samples from a normal population between mean 
and standard deviation. 

The combined log Xn « — 8*547,8434 — 5*756,3691, 

« - 14*304,2125, 

and log w e - V30 x *434,2945 * 2*3787,2895. 

Accordingly: u = 6*01338, and 

# 1 (n - 1, u) - I (29, 6*01338) - *7189. 

and 1 — I (29, 6 01338) = *281, or there would be samples of 15 drawn from a normal 
surface some 28 °/ 0 with more improbable sets of means and standard deviations 
than occur in this set of 16 samples of five lamps. Thus from whatever standpoint 
we regard the problem, we have not succeeded in condemning the hypothesis that 
the 15 samples of five lamps each may have been drawn from the same normal 
population. 

But there is a considerable difference in the combined s-test and the com- 
bined \n-test (i.e. Qa m = =, 070 and Q Ai0 «*281). We have already pointed out that 
the £-fcest only involves the populations from which the individual zb are 
obtained having the same mean and being normal ; these populations may have 
different standard deviations, and if the selections from each of these distinct 
populations be random, their probability integrals for z will all follow .a rect- 
angular distribution and may be combined. On the other hand our combined test 
assumes normal parent populations with the same mean and the same standard 
deviation. We should thus expect it to be more stringent than the £-test; 
actually it is less. But this may be accounted for by another fabtor which arises 
here : the z - test associates every difference of mean with a definite standard deviation. 

In the present illustration the 15 standard deviations, **, of the samples might 
have been associated with any one of the 15 differences of mean, m t — M. In this 
respect the combined test of this illustration seems to be less stringent than the 
s-test of Illustration 2. 

Accordingly we ought to be very careful in considering a test to state the 
hypothesis we are testing on as wide a basis as it admits in regard to the methods 
employed. 

Thus in Illustration 2 we are not really testing whether the 15 means come 
from a single normal population with the same mean and standard deviation. We 
are testing whether the 15 means come from 15 normal populations with the same 
mean and possibly different standard deviations. If the result were such that we 
rejected the latter hypothesis, the former must be rejected for it is involved in the 
latter. But if the latter be reasonable, it does not ' follow that the former also 
may be. 



PeabSON 


809 


In Illustrations 1 and 5 we have made — the hypothesis that all the 16 samples 
were drawn from the same normal population, and calculated its mean and standard 
deviation as the most probable values on the basis of the data provided by the 16 
samples. But we have not specified that the 15 standard deviations are to be 
associated with special values of the differences of the means. Had the result 
come out highly improbable we should have rejected the wider hypothesis, and 
accordingly the narrower, which is included in it. 

One point may be noted. The probability of a worse result for the means 
* *6488 and for a worse result for the standard deviations * '1179. The combined * 
improbability of a worse result for both, as these are independent, *= *0765, which 
is of the same order of probability as the *0696 provided by the x-test in 
Illustration 2. The correspondence is possibly of no significance. 

If we take the 30 probability integrals provided by Illustrations 1 and 5, and 
distribute them in five subranges of *2, we find 



00—0-2 

0*2 — 0*4 

04— 0*6 

0-6— 0 *8 

0-8—1 *0 

Expected 

6 

6 

6 

6 

0 

Observed 

6 

e 

7 

8 

3 


The x %aB 2 3333, and the corresponding P*« for five groups — -676. 

This may be compared with the *281, as indicating the weakness of the 

assumptions on which the x 8 , P method depends for a small number of groups and 
a small total frequency. 

Illustration 8. Application to Linearity of Regressions . In some recent work 
by Professor H. Ruger the correlation tables for Weight and Vital Capacity in 
28 age groups were reached and the values of r, ijx. y* Vy.x calculated for each 
of these groups. From these values the functions £c.y = (v*x.y — v-*)/(l — r 8 ) and 
mX — r*)/(l — r 8 ) were tabled. The distribution of f on the assumptions 
made by Fisher of linearity of regression, with homoscedastic normal arrays, leads 
to the curve 

:(28), 

where n is the size of the sample and a the number of arrays ; and accordingly 
the probability integral of £ could be found from the Tables of the Incomplete 
B-Fimction as long as n — a — 2 5 100. 

Unfortunately in the data we have referred to this condition is only satisfied 
in one case ; in the others N — a — 2 is in excess and often considerably in excess 
of 100. We therefore replace in (28) the second factor by an exponential term, 
and we have, if f * $ (n — a -*• 2) f, the distribution 


( 29 ). 




400 


General Criterion for Random Sampling 


The probability integral will accordingly be I (J (a — 4), tt), where 

Vi(a-2) 

and I ( p , u) is the incomplete T-function ratio to be sought for in the Tables of 
the Incomplete T -Function. This is feasible up to 104 arrays, a number unlikely 
to be required. 

The reader must bear in mind that a is the number of arrays on which t) is 
computed, and we must distinguish a x , y corresponding to rjso.y from dy %x corre- 
sponding tO 


The problem before us is the following : we have 28 tables and 28 values of r 
and say v y .^ i.e. Weight on Vital Capacity. May the 28 regression lines of Weight 
on Capacity be considered as a system, which does not differ from straight lines 


TABLE IX. 

Regression of Weight on Vital Capacity. 


No. of 
Sample 
t 

Age 

Group 

Sise 

a y.x 

IK.-4) 

nt-Vy.x- 2 

tp# 


u 

Pt 

logft 

J 2K..-2) 

1 

6 — 12 

105 

13 

jp- 5-6 

1(7-46 



•247,5080 




*004 

3-602,0600 

2 

13—16 

331 

15 

6*5 

314 

*067,7070 

5*099,0195 

4*16943 

*068 

§•832,5089 

3 

16 

241 

16 

5-6 

224 

*068,6630 

5-099,0195 

3*01593 

*285 

1*454,8449 

4 

17 

288 

14 

5 

272 

*031,4678 

4*898,9795 

1*74715 

*022 

2-342,4227 

& 

18 

320 

16 

5*6 

303 

•040,2568 

5*099,0195 

2*39219 

*512 

1-709,2700 

6 

19 

310 

18 

7 

290 

•066,5070 

5*656,8542 

2-84568 

*446 

1-649,3349 

7 

20 

389 

17 

6*5 

370 

•013,1104 

5*477,2266 

0-88564 

•993 

1-996,9492 

8 

21 

321 

16 

6 

303 

•067,4679 

5-291,5026 

3*29070® 

*235 

1-371,0679 

9 

22 

289 

16 

6 

271 

*009,2677 

6-291,5026 

0*47*64 

•999(64) 

T -999,8436 

10 

23 

316 

17 

6-5 

297 

•063,0081 

5*477,2256 

3-41658 

*227 

1-366,0259 

11 

24 

270 

14 

5 

260 

•067,7665 

4-898,9795 

3-59599 

•128 

1*107, 2100 

12 

26 

223 

17 

6*5 

204 

■063,3770 

5*477,2256 

2-36048 

•608 

T‘783,9036 

13 

26 

224 

16 

6 

226 

•049,5475 

5*291,5026 

2-11617 

•670 

T-826,0748 

14 ! 

27 

195 

15 

5-6 

178 

•076,0919 

5-099,0195 

2-62136 

•430 

1-633,4685 

16 

28 

188 

16 

6 

170 

•086,0810 

5-291,5026 

2-76267 

•409 

T-61 1,7233 

16 

29 

186 

15 

5*5 

169 

•096,6815 

5*099,0196 

3-20438 

•231 

1-363,0120 

17 

30 

198 

14 

5 

182 

•068,2614 

4*898,9795 

2-53595 

•412® 

1*615,4240 

18 

31—32 

305 

17 

6*5 

286 

•081,0745 

5-477,2256 

4*23366 

•080 

§•903,0900 

19 

33—34 

280 

16 

6 

262 

•036,9607 

6*291,5026 

1*82956 

•785 

T’894,8697 

20 

36—36 

290 

16 

6 

272 

•066,4152 

5*291,5026 

2*89902 

*356 

I ’650,2284 

21 

37—38 

266 

16 

6 

248 

•030,2694 

5*291,5026 

1-69986 

•831 

1-919, 6010 

22 

39—41 

319 

16 

6 

301 

•040,0175 

5*291,5026 

2*20878 

•606 

1-782,4726 

23 

42—44 

267 

14 

5 

251 

•022,8722 

4*898,9796 

1*17186 

•282 

1*460,2491 

24 

46—47 

196 

16 

6 

178 

•055,8809 

5*291,5026 

1*87977 

•766 

1*884,2288 

26 

48-61 

222 

16 

6 

204 

•023,4922 

5-291,5026 

0*90568 

•884 

1*946, 4623 

26 

62—66 

147 

15 

5*5 

130 

•097,3283 

5-099,0195 

2*48139 

*476 

1*676,6936 

27 

66-61 

186 

14 

5 

170 

•065,0076 

4*898,9795 

1*90882 

•673 

1*826,0161 

28 

62—81 

163 

17 

6*5 

144 

•180,1917 

5-477,2256 

3*94864 

*118 

1-071, 8820 


Pi gives the probability of a higher f*,* occurring than that observed, 
gi 14*836,4732, \/« log^e* 2' -298,0706. Hence u- 6*45606. 





Karl Pearson 


401 


more than would reasonably be the result of random sampling 1 In other words 
are the 28 as based on the 28 f's a random sample from the distribution 
curve (29) ? 

For the £th sample, we require n t the size of the sample, a„.«, the number of 
arrays of Weight on Vital Capacity, the £ y . Xi V2 (a y . w - 4), — ay., — 2 and the 
P tat \{ a y.x m - 4) of the incomplete T- function, and lastly the value of 

V2(ay. x -4) ?l/a!# 

Hence from jp and u we find the probability integral p t of the sample and record 
its logarithm. Their sum gives the required logioXn- 

Thus we find I (27, 6*45606) * *8748 and sets of 28 tables between Weight and 
* Vital Capacity for age groups like the above, if obtained as random samples from 
parent populations with a linear regression of Weight on Vital Capacity would give 
a less probable set in 12*5 % of cases. Our results therefore taken as a whole do not 
provide strong evidence of non-linear regression in thecaseof Weight on Vital Capacity. 

Illustration 9. Random Samples of Correlation Coefficient from a Normal 
Population of zero Coefficient The following system of racial correlation coefficients 
for Cephalic Index (100 B/L) and Upper Face Index is given by Tippett*. Applying 
an approximate method he concludes as follows: 

“Thus the combined experience of Table XLIII [this corresponds to our Table X 
below] lends no support to the view that the two characters are associated even 
after making allowance for the possibility of racial differences,” p. 143. 

It seems worth while investigating whether the methods of the present paper 
confirm Tippett’s conclusions. 


TABLE X. 

Correlation Coefficients of Cephalic Index with Upper Face Index for thirteen Races' f. 


Index 

No. 


Size of 

Correlation 

Index 

No. 


Size of 

Correlation 

Race 

Sample 

fit 

Coefficient 

n 

Race 

Sample 

n t 

Coefficient 

U 

1 

Australians . . . 

66 

+ 0*089 

7 

Polynesians ... 

44 

+0-002 

2 

Negroes 

Duke of York) 

77 

+ 0*182 

8 

Alfourons 

19 

— 0*302 

3 

53 

-0*093 

9 

Micronesians... 

32 

-0-251 

Islanders / 

10 . 

Copts 

34 

-0147 

4 

Malays 

60 

-0-185 

11 

Etruscans 

47 

-0-021 

6 

Fijians 

32 

+ 0-217 

12 

Europeans 

80 

- 0-198 

6 

Papuans 

39 

-0*265 

13 

Ancient Thebans 

152 

-0*067 


* The Methods of Statfetice , p. 142, 1981. 

f Several of the groups are scarcely anthropological unities, but the series will serve as an example 
of method. 




402 General Criterion for Random Sampling 

We will first find the most probable value of p> the correlation of the parent 
populations, if they all had the same coefficient. Applying the method of p. 394 
above we find, if N**S(n t ) f 

-060,9878, 8 = -024,4218, 

s < n S*) = _ 002,7694, « -001,1779. 

Hence we deduce 

*060,9878, p z «- *062,2504, pg ‘062,2763, and p 4 «- ’062,2772. 
Accordingly we find as the most probable value for p 

*06228. 

It cannot therefore be asserted that the most probable value of p as indicated by „ 
the data is zero. It looks as if there existed a small negative correlation between 
the First Cephalic and the Upper Face Indices. 

Let us first try what the series leads to when we replace this most probable 
value of p by p = 0. In this case the frequency distribution of r as a selection from 
a population having zero correlation coefficient, p = 0, is given by 


n ~ 4 

y=ya (i - r 2 ) 2 (30), 

and accordingly the probability integral is given by 

p„ , ( = | r ^ — r*) i dr •/ J + \ ! — r*)i (n - 4) dr (31). 


This can be reduced to the incomplete B-function ratio by one or other of two 
transformations. 

(i) Take r 2 = x, and we find 

Pn t r * 1 - (i {n - 2), £), if r be positive, 

= (n — 2), £), if r be negative. 

(ii) Put $ (1 4 - r) = tv, and we reach 

Pn,r = /j(i+r> (£ ( n - 2), £ (n - 2)), for both signs of r. 

We shall make use of the second transformation as more readily lending itself 
to interpolation* into the Incomplete B -Function Tables . 

* I have considered it adequate for present purposes to interpolate linearly into the Table * of the 
Incomplete B -Function, but even with this simplification the determination of Ix+$d(P + h> J> + £) causes 
some little trouble, when we are using that part of the tables wherein p is only given to the unit; 
farther we have to remember that I x {p< q) is only tabled for p>g, so that we must find when q is >p, 
9 ) from the relation I x (p, q) = l -/]_* (g, p). The requisite formula is the following, where 
^=1 - 0 and d, the tabulating interval of x, is here unity: 

+ + = + + L p) + L(p*fl,P + l) + l a; (p + l, p)\ 

+ 4^ {L+d(P» p) + l ~ Ji-x-d(p + b P) + -L+d (P + L P + ll + L^-dCP + Lp)}. 

Here all the incomplete B-function ratios wiU be found in the tables, as jp is an integer. 



Karl Peabson 


408 


TABLE XI. 

P\ Test for Tippett’s Data. 


Index 

Number 

t 

Size of 
Sample 

Correlation 

n 

Value of © 

n t 

^-function Value 

.r< 

Numerical 

Value 

Logarithm 

1 

66 

+ •089 

7^(32,32) 

•761 

T-881,3847 

2 

77 

+ •182 

7 wl „(37-5,37'5) 

•946 

1-976,8911 

3 

53 

- *093 

7.4635(25-5, 25-5) 

•255 

T-406,5402 

4 

60 

-•185 

7.40,6(29, 29) 

•079 

2-897,6271 

6 

32 

+ •217 

7.0006(15, 16) 

•117 

1-068,1859 

6 

39 

-•255 

/.37!6(18-5, 18-5) 

•060 

1*778,1513 

7 

44 

+ •002 

7.60,0 (21, 21) 

•505 

1-703,2914 

8 

19 

-•302 

Aj49o(8-5, 8"6) 

•104* 

1*019,1163 

9 

32 

-•251 

73746(15, 15) 

•083 

2-919,0781 

10 

34 

- -147 ! 

74306 (16, 16) 

•203 

1*307,4960 

11 

47 

- -021 

7 4SW (22-5, 22-5) 

•445 

T *648,3600 

12 

80 

- *198 

7.40,0(39, 39) 

•039 

2-591,0646 

13 

152 

-•067 

74066(75, 75)* 

•207* 

1*317,0181 


logA n = mim log -8-480,7952, Vl3 log,,, e= 1-565,8711, w=>6-41986. 

Chance of more improbable sets== 1 — /(12, 5 *41 986) •« *048. 

Tippett applying a normal curve of standard deviation instead of (30) 

v n — 3 

finds a x 2 for 13 lying between 17 and 18 (17*26) which leads to a P = * 141, or 
between the T and *2 levels. It would thus appear that our test is more stringent 
than that applied by Tippett, or the difference may be due to the approximate 
nature of the method used by him. Not being a very enthusiastic advocate of *02 
as a fit measure for rejection of randomness, I am inclined to doubt whether *048 is 
to be taken as sufficient evidence that the series of correlation coefficients are 
random samples of populations with zero coefficients of correlation. 

Illustration 10. Comparison of two Hypotheses. We have already noted that, 
apart from the replacement of binomials by normal curves, the test suffers under 
the disadvantage that it gives the same resulting probability wherever the 
individuals may be in the same set of subranges. The (X^, P^) test allows for this, 
and accordingly is far better suited for answering the problem of whether a 
Hypothesis A is or is not more probable than a Hypothesis B. 

We will illustrate such a comparison by Tippett's data in Illustration 9. In 
that illustration we have taken Tippett's hypothesis that the data are random 
samples from parent populations having zero correlations. This shall be Hypothesis A. 
We have seen that the most likely value of the correlation is not zero, but a correlation 
measured by — *06228. We will ask what is the probability that the thirteen samples 
were drawn from parent populations with the correlation of the variates measured 
by the coefficient — *06228. This is our Hypothesis B. 

* Obtained approximately from corresponding normal curve. 




404 General Criterion for Random Sampling 

To answer the problem requires us to determine the probability integrals for 
p*- -06228 of thirteen samples ranging from size 19 to size 152. The tables of 
the probability integral of r as sampled from normal distributions are not yet 
sufficiently advanced to cover this wide field. Only the sample of 19 falls within 
the present range of those tables. It seemed best to adopt a uniform process for 
all cases, and accordingly the following method was used. It is known that from 
about n = 20 onwards an excellent fit to the distribution curve of r is obtained by 
aid of a Pearson curve* having the same first four moment coefficients *f* as the 
r-distribution. The resulting curves belong to Type I : 

y * t/o (<ii + (a% - a?)’”*” 1 (32), 

and accordingly the probability integrals may all be found from the Tables of 
the Incomplete B -Function. In order to reduce these curves we need to find: 
(i) vix < mt, these are found from the fti and fit, (ii) the range b, this is found 
from pt, fii and fit and (iii) the distance d of the observed correlation coefficient from 
the start of the Pearson curve. This must be reduced to d! by dividing by the 
range b, for entry into the B -function table. 

In determining the incomplete B-function ratio we must remember that as 
mt iR always greater than we must look up in the tables not I d >(vt\, mt) but 
its equivalent 1 — I^ d > (m 2 ', nii)^p n . The probability integral thus obtained is 
the probability of the occurrence of samples with a greater improbability than the 
observed sample. Table XII gives the values of the constants of the curves, the 
corresponding ^ n ’s and their logarithms and finally the value of u with which we 
enter the Incomplete Y -Function Table. This gives by linear interpolation: 

P An = 1 - I (12, 3-674,963) = 1 - -5640 - *4360. * 

Thus by the hypothesis that Cephalic Index and Upper Face Index have a 
correlation coefficient equal to the most probable value provided by the whole set 
of experiences we obtain a probability rather more than nine times as great as that 
provided by the hypothesis that the correlation coefficient is really zero. To those 
who have had experience of the correlation between cranial characters in long series, 
the fact that small correlations between such may be significant is familar. There 
is accordingly no ground for assuming that because we have four positive and nine 
negative small correlation coefficients it is a reasonable hypothesis that the correlation 
coefficient between these two characters is zero. 

But here I reach my main criticism of the method now frequently adopted for 
testing hypotheses. Arbitrary values like P = *01 or P = *02 are taken to indicate 
the improbability of a hypothesis. A hypothesis is then found to have a P = *04 

or *06, and it is stated to be thus shown to be reasonable. A conclusion is then 

drawn from the hypothesis, which is taken as a physical principle, for examples 
that no difference exists between two populations, or that a correlation coefficient 
is zero. No attention is paid to the fact that another hypothesis may prove more 

# Biometrika , Yol. xi. pp. 882 — 886, or Tables for Statisticians, Part II. pp. olvii — olxi. 

t Biometrika , Yol. xi. pp. 887 — 888, or Tables for Statisticians , Part II. p. clxii. 



TABLE XII. Probability Integrals for Tippett's Senes on the Hypothesis that p = — * 06228 . 


Karl Pearson 


405 



login 8=1-565,8711, m= 3-674,963. S (log p n ) = log -5-754,5178. 

The frequency is too great to bring this case into the range of the Incomplete B -Function Table , and accordingly the normal curve was used. 




406 


General Criterion for Random Sampling 


stringent* and indicate a difference between two populations, or show that it is 
more reasonable to suppose the correlation coefficient not zero. 

Illustration 9. Comparison of two Hypotheses . I will take a further illustration 
of such a comparison which will also cast more light on the difficulties I feel with 
regard to small samples. 

The following thirty observations given in column ( b ) of Table XIII are a 
random sample and we take as our Hypothesis A that the parent population had 

TABLE XIII. 

Test for Normality in Parent Population from Sample . 


(a) 

(«-) 

(«) 

(d) 

(«) 

(/) 

Index 

No. 

Observa- 
tion x t 

Deviation 
from Mean 

V 

*,'/2 

Pti 

l°8io Pt 

1 

25 

-419-5 

-1*485 

•0688 

2-837,6884 

2 

550 

+ 105-5 

+ -373 

•6454 

1-809,8290 

3 

517 

+ 72-5 

+ -257 

•6014 

1*779,1634 

4 

33 

— 411*5 

-1-457 

■0726 

2-860,3380 

5 

210 

— 234-5 

- -830 

•2033 

1-308,1374 

6 

641 

+ 196-5 

+ -696 

*7568 

1*878,9811 

7 

477 

+ 32*5 

+ -115 

•5458 

1-737,0335 

8 

318 

-126-5 

- -448 

•3271 

1-614,6805 

9 

418 

- 26*5 

- -094 

*4626 

1 -665,2056 

10 

277 

-167-5 

- -593 

•3766 

1-576,8803 

11 

532 

+ 87-5 

+ -310 

•6217 

1-793,5809 

12 

466 

+ 21-5 

+ -076 

•5303 

1-724,5216 

13 

671 

+ 226-5 

+ -802 

•7887 

1-896,9118 

14 

595 

+ 150-5 

+ *533 

•7030 

1-846,9553 

15 

152 

-292-5 

-1-035 

•1503 

1-176,9590 

16 

988 

+ 543-5 

+ 1-924 

•9728 

1-988,0236 

17 

420 

- 24-5 

- -087 

•4653 

1-6^7,7331 

18 

625 

+ 180-5 

+ -639 

•7386 

1-868,4093 

19 

389 

- 55-5 

- -196 

*4333 

1-636,7887 

20 

171 

-273-5 

- -968 

•1665 

1-221,4142 

21 

968 

+ 523-5 

+ 1-853 

•9681 

1-985,9202 

22 

728 

+ 283-5 

+ 1-004 

•8423 

1-925,4668 

23 

949 

+ 504-5 

+ 1-786 

•9629 

1-983,5812 

24 

178 

-266*5 

- *943 

•1728 

1-237,6437 

25 

120 

-324-5 

- 1-149 

•1253 

1-097,9511 

26 

144 

-300-5 

-1-064 

*1437 

1-157,4668 

27 

37 

-407*5 

-1*443 

•0745 

2-872,1563 

28 

944 

+ 499-5 

+ 1-768 

-9616 

1-982,9493 

29 

289 

-155-5 

- -550 

•2912 

1-464,1914 

30 

503 

+ 58-5 

+ -207 

•5820 

1-764,9230 

Mean =444 ‘5 

2-282 

•49328 

logio*.- 

-12*739,7252 


= *4203,9259,67, /(n- 1, w)» 7(29, 5-3556, 8010) **47651. 

V 30 Iog 10 e 

* For illustrations in degrees of stringency of different tests, see Biomctrika , Vol. xxiv. pp. 806 tt $eq. 

f Found by linear interpolation from Sheppard’s Tables. 





Karl Pearson 


407 


a norma] distribution. The mean M of the observations is M » 444*5 and their 
standard deviation 2 = 282*49328. We will adopt these as probable values in the 
parent population. Column (c) of Table XIll gives the deviations from the mean; 
column ( d ) expresses these in terms of the standard deviation, column ( e ) gives the 
probability integrals p t , and column (/) their logarithms the sum of which is 
— 12*739,7252. Accordingly since ljs/n logi 0 e » *4203,9259,67, we require to find 
I (29, 5*3556,8616) from the Table of the Incomplete T-Function. Interpolating by 
means of B 2 (i.e. to third difference accuracy), we have 

P An * I (29, 5*3556,8616) * *47551, and » *52449. 

Thus between 52°/ 0 and 53% samples from the above mentioned normal 
population would have a less degree of probability than the observed sample. 
Shall we therefore assume it “ reasonable ” to suppose the sample drawn from a 
normal population ? I fear many statisticians will say that it is, and not hesitate 
to draw any inferences that may be based on such an assumption. I doubt, how- 
ever, whether the use of the word “reasonable” is proper in a case of this kind. 
There are possibly far more probable hypotheses as to the nature of the parent 
population, which might lead us to base very different conclusions on the nature 
of the sample, for example that the range of possible observations was narrowly 
limited, or that the frequency of the parent population was not such that small or 
large values of the observations occurred with relatively small frequency. We will 
take a second hypothesis, Hypothesis B , that the sample has been drawn from 
a rectangular population. The maximum range as shown by the sample 
= 988 - 25 a* 963. This is most probably the modal range of samples from the 
parent population = (n — 2) b/(n — 1)* «■ where b is the range of the parent 
population. Hence b = $$ x 963 = 997*39286. This seems a good value to take for 
the range of the parent population. Table XIV, column (b) gives the observations; 
(c) the probability integrals; ( d ) their logarithms, with their sum, leading to the 
incomplete T-function P A|t = / (29, 6*2168,5177), the evaluation of which by B 2 
interpolation from the Incomplete T-Function Tables is equal to *77905, and 

= *22095. 

Accordingly almost 22 % °f samples of sets of observations from a rectangular 
parent of the above range would be more improbable than the observed set, and 
the Hypothesis A is seen to be much more probable than Hypothesis JB, although 
if Hypothesis B had been first tried and its probability, *22095, computed, many 
statisticians would have been content with its “reasonableness,” and not have 
proceeded further. Now the strange fact is that the observations were actually 
taken as the first three figures of the first six sets of five on sheet XXIV of 
Tippett's Random Sampling Numbers , and may therefore be supposed to form a 
random sample of 30 from a rectangular parent population of range 1000. Our 
new method should enable us fairly readily to compare the probability of different 
hypotheses. But the main point to be noted is that because one hypothesis has a 


Biometrika , Vol. xxm. p. 894. 



408 


General Criterion for Random Sampling 


TABLE XIV. 

Test for Rectangularity of Parent Population from Sample . 


(o) 

Index 

No. 

m 

Observa- 

tion 

<«> 

p t =(b) 

-^097-89286 

w 

log, o Pi 

(a) 

Index 

No. 

(b) 

Observa- 

tion 

(«> 

Pi=m 

-j- 997-89288 

(<*) 

log , 0 Pi 

1 

25 

‘0251 

2-399,6737 

16 

988 

•9906 

1-995,8983 

2 

550 

•5514 

1-741,4668 

17 

420 

‘4211 

T -624,3852 

3 

517 

•5184 

1-714,6650 

18 

625 

•6266 

1-796,9904 

4 

33 

•0331 

2*519,8280 

19 

389 

•3900 

1*591,0646 

ft 

210 

•2105 

1-323,2521 

20 

171 

•1714 

1-234,0108 

6 

641 

•6427 

1-808,0083 

21 

968 

•9705 

1*986,9955 

7 

477 

•4782 

1 -679,6096 

22 

728 

•7299 

1-863,2634 

8 

318 

•3188 

1-503,5183 

23 

949 

•9515 

1-978,4088 

9 

418 

•4191 

1 -622,3 177 

24 

178 

•1785 

t *251,6382 

10 

277 

•2777 

1-443,5759 

25 

120 

•1203 

1*080,2656 

11 

532 

•5334 

1*727,0530 

26 

144 

•1444 

1*159,5672 

12 

466 

•4672 

1-669,5028 

27 

37 

•0371 

2*569,3739 

13 

671 

•6728 

1 -827,8860 

28 

944 

•9465 

] *976,120 6 

14 

595 

•5966 

l *775,6832 

29 

289 

•2898 i 

1-462,0984 

15 

152 

•1524 

1* 182,9850 

30 

503 

•5043 

1 -702,6890 

p t values retained only to four places 


-14*788,2047 



«/(»-!, 6-2168, 6177)~‘77905* 


V« logio® 

= 6*2168,5177 


very considerable probability and a higher probability than a second, it does not 
follow that it is reasonable to suppose that hypothesis to hold and another to be 
false, and hence draw conclusions from the former holding f. 

The effect of using small samples is to render it quite probable (probability 
** *50 about !) that a sample was drawn from a population differing very widely 
from the population out of which it was actually extracted. In short the fact that 
a hypothesis is even very probable in the case of a small sample by no means 
demonstrates that it is a “reasonable ” hypothesis, and that accordingly inferences 
may be drawn from itj. All in fact that the present test and other tests besides 

* If b be found from it equals 978*592,866 and JP^r-75012. But this b is less than the 

observation 988 and gives one probability value =1*0096, i.e. >1 ! 

t The greater probability of the normal hypothesis is here explicable because we have two constants 
to dispose of, while in the case of the reotangle, we have only made use of the range, measuring it from 
the zero of observation. It is not very easy, short of approximations of considerable length, to deter- 
mine the best “oentre” and the best range of a rectangular population from a given sample, and clearly 
any attempt to do so moves us away from the true parenLpopulation. 

X I have elsewhere ( Biometrika , Vol. xxiv. p. 371) indicated that given two parent populations as 
divergent in distribution as the normal and rectangular, it is not possible to deny the truth of one or 
other hypothesis unless the sample approaches 100 to 150 in magnitude. 



Karl Pearson 


400 


can achieve if several hypotheses are found to have considerable probability 
is to test their relative reasonableness, and even this may deceive tis, as just 
exemplified. 

Conclusions. 

(i) A very general test, the P^ test, has been discussed which seems to the 
writer to involve fewer approximations and assumptions than the P*a test. He would 
emphasise its advantages in this respect in the case of small samples, where it 
appears to him that the application of the P*« test may well lead to erroneous con- 
clusions, for it fails in stringency. 

(ii) The P Kn is not claimed to be a test of maximum stringency, but as having 
a very wide field of application, especially when the constants of unknown parent 
populations are given their most probable values. The method which proceeds 
from these values seems to him as effective as attempting to find tests, which involve 
only sample values. 

(iii) The Pa* test involves determining probability integrals, but tables of such 
integrals are now largely available and more will shortly be published. 

(iv) It appeals first to the principle of independent probabilities, to ascertain 
the probability of more improbable individual occurrences, and then starting from 
this probability measures the probability of all sets of occurrences, — not necessarily 
greater in each individual variate but more improbable as a whole set. 

(v) A number of illustrations are provided to indicate the breadth of the 
method, and in particular its value in the comparison of hypotheses. 

(vi) The writer endeavours to emphasise a point which has, he thinks, not always 
been sufficiently regarded, namely that because a set of occurrences is found on 
a selected hypothesis Hi not to be very improbable by test A , it does not follow 
that that hypothesis may be safely regarded as applying to the occurrences. A more 
stringent test B may show Hi to be very improbable, or either or both tests, A and 
B , may show another hypothesis H 2 to be far more probable. In other words a Tiest 
may suffice to allow us reasonably to reject a hypothesis, but only rarely (and 
generally in the cases where there is large previous experience) justifies us in 
accepting the hypothesis as a rule of conduct, or as a mode of extracting further 
information from our data. 

(vii) Lastly, emphasis should be laid on the point that while probability 
integrals for a given investigation should ail be measured in one direction, that 
direction may initially be either direction. In other words, a very high P a* is 
calculated to arouse our suspicion as well as a very low Pa*. Really this warning 
needs to be borne in mind with nearly all tests, in particular with the P, y? test. 
Px* is a probability integral of the x* curve measured in a particular direction, but 
there is no more valid reason for initially measuring it in one and not the opposite 
direction, than in the case of the normal curve probability integral, and, when the 

Biometrika xxv 27 



410 


General Criterion for Randomness 


size of the sample is not too small, x 2 approaching zero, or approaching unity 
is as definite a warning as x 2 ver y large and Px* approaching zero. These con- 
siderations are equally valid when we consider P^, which should approach the 
value -5 with increasing size of sample if the probability integrals are truly random, 
but marked deviation either way from this value is a warning that something is 
improbable either in the sampling or in the hypothesis from which the probability 
integrals have been deduced. 

The present paper would have been impossible without the use of the Incomplete 
B- and Y-Function Tables . The author has gratefully to acknowledge the aid in 
computing work of Miss F. N. David under a grant from the Department of 
Scientific and Industrial Research. 

Note, added December 6, 1933. 

After this paper had been set up Dr Egon S. Pearson drew my attention to 
Section 21 T in the Fourth Edition of Professor R. A. Fisher’s Statistical Methods 
for Research Workers , 1932. Professor Fisher is brief, but his method is essentially 
what I had thought to be novel. He uses, however, a x 2 method, not my incomplete 
T-function solution ; this explains the relation referred to in the footnote on p. 383 
of my paper. As ray paper was already set up and illustrates, more amply than 
Professor Fisher’s two pages, some of the advantages and some of the difficulties 
of the new method, which may be helpful to students, I have allowed it to stand. 

K. P. 



MISCELLANEA. 


(1) The Distribution of fe in samples of 4 from a Normal Universe. 

By A. T. McKAY, M.Sc. 

The estimated value of ££ 2 is the statistic 

*~i\ f. (!)• 

{* (*r-50 , J 

Let us employ the orthogonal transformations 

«= y j + ^2 + + ,^4 

-yi-y2+y 3 +y4 
2 ^ 4 = y\-yi-y*+y* 

4 4 4 3 3 

noting that 2.r r 2 == 2y r 2 , 2 j?) a =2y r a , * = £y 4 , 2 (a?j -.r)«2y r , etc., then 

1111 i 

162 (« r - a-) 4 * {0/i +.V 2 +y 3 ) 4 + (yi + #n -y 2 ) 4 + (yi +.y 2 -y 3 ) 4 + (y« +y» ■ -yO 4 } (3). 

Since the expression on the right-hand side of (3) is cyclical and unaltered by a chango of 
sign of any variable, we may infer that 

162 (x T ~ *)W (y,* +y 2 * +y 8 *) a + D Wyf + y 2 *y 3 * +^Vi a ) (4). 

Giving suitable values to the variables, we can readily find A = 4, J5 = 16, whence 

a ^ i w .yi*.y 2 2 + ysW+jtfyi 1 

( * _i) — w- 

Now since the expression on the right-hand side of (1) has four variables but only three 
degrees of freedom, we should bo able to ascertain the range of fluctuation of 0 2 by considering 
the expression (6). Write y! 2 -»X, y 2 2 = ^ ; then 

,_xj+FMJr 

(x+r+Xj** w 

Differentiating partially with respect to each of the three variables in turn and equating to 
zero, we derive 

r(Y-x)+z(z- x)«cf| 

X(X-F)+X(X- F)-ol (7 ), 

x(X-x)+f(f-z)»oJ 

from which it follows that there is a turning value when X «= F = X. Further, since 

d*0! 0*0! 0^i -2 

dX^aT* 3 * 0X a "27a* 


27 — 2 



4 


Miscellanea 


when 0, we can conclude that 


°^n (9), 


irrespective of the character of the parent universe. 

Let us now return to our main problem. We see from (5) that it is necessary to integrate 

e ** Vl * dyidyzdi/z (10) 

(2t)» 

over a triply infinite field conditioned by 

<“>• 

Transform to polar coordinates by writing r cos 6 , y a «-r sin 6 . cos <£, y 3 =r sin 0 . sin hence 

equation (10) becomes 

Sin 6 ij. *j O _r8/‘i J. /l A\ 


t d$.d<p.^e^.dr., 


and condition (11) reduces to 

w < sin 2 0.coH a 0 + sin 4 0. sin 2 <£. eos a </>< (IS), 

whore 0 < 0 ^ n and 0 < (p ^ 2n. 

Integrating out the r-term in (12) and writing cos0»= - x y 2 </> = $ in (12) and (13\ wo may 
derive (2/rr) dx.dQ in place of (12) and 

w < .7^(1— + J(l— a? 2 ) 2 sin 2 $ < w+bw (14), 

where now 0^.z^l and 0^4 »^tt/ 2 indicate the limits of the field of integration which is to be 
conditioned by the last inequality. The moments of the distribution of w about the origin are 
thus given by 

/ {^ (1 - .r 2 ) + (1 - .r 2 ) 2 sin 2 4>j* drd<t> (15). 

By expanding the integrand by the binomial theorem and informing the integration term 
by term, wo find 

M l $T(k-%)k(k-l)T(k+3) , \ 

+ 2‘2 ‘ 4 a (2 !) a + ’*‘J K 

Hence v*® A> v 3=7r6J*» 80 that if p refers to the moments of & about the 

mean, ^-012190, -002455, fA4 « 0*034276, 0*33273, £*-2*30645. 

Returning now to (15) and the remarks which precede it, we conclude that if (p (w) is the 
distribution which is being sought, then 


<p(w)**~ dvm i i f 

rrjdw irJUw-A 


rrjdw irj (J(l— x*)*— w+aP-X*)}* 

where the limits of the integral are such that x runs from 0 to 1 subject to the condition 

^ (l-**) 2 ** 

In order to find the appropriate limits, we consider the family of curves defined by 

. w—t + t l 



Scale cf \ 


Miscellanea 


413 


and shown in the following diagram. We note from Equation (18) that only values of X in the 
unshaded part of the diagram are consistent with our requirements. 



There are plainly two cases to consider. 

Case (i). O^w^l. 

Let the curve marked A lie regarded as a typical curve for this case. Then there are two 
independent ranges which satisfy our conditions, i.e. 

( 1 ) 0 

and (2) 


Case ( ii). 

Here we take B as a typical curve, which shows that there is only one possible range, viz. 

(3) 

By writing X=0 and X«| in Equation (19), we readily find 


h — J (i+ Vl — 4tw) 
h** ^ 3 /== ^ (1 4* 2 s/ 1 — ■ 3 w) 


( 20 ). 


If now in (17) we change the variable by writing we conclude 


(J' 1 + J'*) F{t)dU O^w^i (21) 


“2“ j'’F(t)dt i (22), 

where F(t) = {* (w - 1 + 1 2 ) (£ (1 - 0 s - w + * - **)} ” * (23). 


414 


Miscellanea 


Let us now make the substitution 

t = l {1 + 2 y/l - Zw sin 6 } . 


.(24), 


then 


dt 


F (<) — -6 {(9m> - 2) - 2 ( 1 - 3u>) 8 sin 36 f 8 (26). 

do 


when (26) 

when (27). 


Hence writing 4 (1 — 3w) sin a when 0 < u> £ i , we find 

Substituting 3a«u72+/3, and noting that sin 3« = £ (2-9w)/(l -3w)*, we deduce 

— * P * , 

Jr (1 - 3w)* J o (2 cos £ - 2 cos £)* 

B i rff 

n J o {(9 m> - 2) + 2 (1 — 3w)* cos£} 

The first of these integrals (28) is a Mehlor Integral, hence 

(*• i J » i?)} 

whenO^^^| and y~(9w-2)/2 (l-3v)l. 

The second integral (29) may be written 




(28) 



(29). 

TiS 

(30) 


f8ir 


d$ 


* (W) “ir(9w-2y*J o (l + (cos£)/y)* 


.(31), 


the integrand of which can be expanded by the binomial theorem and integrated term by term 
to yield 


1: y) 

We note in passing that this last equation may also be written 


when l ^ w < J (32). 


+ {w) ~nl 


•(33), 


where Q represents the Legendre Function of the second kind. 

By writing w — l (* — 1) in equations (30) and (32), we finally find that the distribution of 
in samples of four from a normal universe is given by/(#), where 


(l> 4 ; * s 

'V) 

when 1<*^2... 

...(34) 

(i.i; i; 


when 2^*^ j... 

...(36), 


2(0* -17)* " 

where g » (9* - 1 7)/(7 — 3*)^. 

The chief characteristics of this distribution are shown in the accompanying diagram. Owing 
to the extreme difficulty of calculating the values of the hyper-geometric functions this had to 
be omitted, so that the two curves have merely been sketched in. Should, however, it prove 
desirable to have an exact knowledge of the form of these curves, the procedure indicated in the 
previous paper* would most likely prove of considerable assistance. 


* IHometrika, Vol. xxv. Parts i and n (1988) (Miscellanea). 



Miscellanea 


41 5 


u 

5kctdv showing the I 

1 

10 

general character' / 

1 

\i 

of the distribution of / 

\ 

ii 

Pz hi samples of 4 j 

\ 

10 

from a normal* Universe: / 

\ 

'9 

/ 


ii 

/ 


V 7 



*!* 

s' 


^5 



A 

-ft 



o 

X 



•7 

lO 7*7 IX 7 3 7 4 15 7 b 7 7 7 8 t‘9 Z 

0 X7 U Z3 


Staler df DC 


(li) A Note on the Distribution of Range in Samples of ». 

By A. T. McKAY, M.Sc., and E. S. PEARSON, I».8c. 

In a recent paper* one of the present writers has provided a table giving certain percentage 
limits for the distribution of range in samples from a normal population. These limits were 
obtained on the assumption that the distribution could be adequately represented by Pearson 
curves having the appropriate moment coefficients. The theoretical treatment in Section (1) 
below, while following the general method of approach previously employed, leads to certain new 
results regarding the form of the range curve at the terminals, and also provides the exact 
distribution of range in the case of samples of 3 from a normal population. In this latter case, 
therefore, it makes possible a check on the accuracy of the published table. 

(1) Theoretical Treatment . 

Let x u J7 2 , ..., x n be a random sample of n from a universe defined in the interval ( - 6, a) for 
x by the probability function 

y=f(x) (i). 

* E. S. Pearson, BiometHka , Vol. xxiv. p. 416. 



416 


Miscellanea 


To find the distribution of the range, i.e. the numerical value of the difference between the 
greatest and least observations in a random sample of n, we require to integrate 

/(* i)/(^ 2 )---/(^n) dx 1 dx 2 ...dx n (2) 

over an appropriate field. 

Consider any pair of values x x and x % selected from the group of n. Then the compound 
probability that these two values are the extremes of the group, x x being the least and x% the 
greatest, and at the same time give a value of the range lying between w and w + bw, is found 
by integrating the expression (2) over all possible values of X\ and a? a , subject to the conditions 

x J <x r <x i r«3, 4, n) ^ 

and w< \x% — j?i| J 

Since, however, the pair of values x x and x 2 can be selected in n(n — l) different ways, the total 
probability required is to be derived by multiplying the results of the integration by »(«. — !). 


•( 4 ), 


Thus if the required distribution function is <f> (w), we find 

4>{w) bw~n(n— 1) j f /(* i)/(*«) (/>) dxj dxidxt 

where the limits of the integrals for x x and x 2 are chosen to satisfy the second condition of 
Equation (3). 

Let us now transform the variables by the substitutions 

{x*-x x )-u<l 2, 

{x 2 +Xj)^v J2. 



Then condition (3) requires that we integrate throughout the shaded strip in the diagram. The 
limits for u are therefore u and u while those for v are (u-b and ( a - u). Hence, 


::r>-r^ « 

or substituting u»w/>J 2 and 


/ o-Jtr / ft+iw \»-2 

I f{x)dx J dt 

\J t-\w / 

where the range for w is from 0 to (a + b). 


>(&)i 


Miscellanea 


417 


Example 1. Distribution of the Range in samples of n from a Rectangular Universe*. 
In this case /(#)■»£, whence 


*<*)■ 


»(»-!) 

2« 


fS*-*(2-w) (7). 


Example 2. Distribution of the Range in samples of n from a Straight Line Universe. 
Take y «/(*).* 2(1-*) 

“0 o?<0 and >1, 

so that 0 and a=»l, then after a simple integration and reduction we find 
n(»“l)#“ 2 (2# +1 (2-t0)* +1 w 8 (2-te)*“ 1 l 


2 t(n«-l) + (» + 1) “ (w-1) 


.( 8 ). 


Example 3. The distribution of the Range in samples of 3 from a Normal Universe. 
e -*»/2 

Write w=*3,/(.r) = - 7 ^=r' and a~b= oc in Equation (6), whence 
v2tt 


<p(w) = 


dg-^/4 /•» 






(2tt) 3 /2; j t „„, 2 

Putting x=(y+t) and changing the order of integration, we get 


e-^dxdt ( 9 ). 


Cl fl -vy 3/4 fw/2 fa o 

</> (*) = dt dy ( 10 ) 

= W3'j» * ^ 

( WhJ e-vWdu (writing m = Vjy) (11)- 

rr>/^ J o 


The kth. moment, p ki of this distribution about the origin is 

, e r«> nlM 


fi fit, MJfi , /■« /-l/A/fi 

„ » ° e-*l*dHdw=~%> / / (12). 

r rry/2 Jo Jo irsfZjoJo 


Writing v^u/w, changing the order of integration and integrating for w in the last equation, 
we find 

6r (l +1 ) 


f [Wo 

J* \ 


dv 


(l+2u a ) i( * +S> 


2*f+i x 

^72 

Substituting v**( l /s/2) tan 0 in the latter integral, we finally derive 

3 


.(13). 


Mfc B 


• X 2‘+> r (|+ l) J’ ,e cos *3(16 (14). 


To obtain some idea of the character of the range curve in the neighbourhood of its terminals, 
for the case in which the parent universe extends from - qo to + ao , we can approximate as 
follows. 


Supposing w is very large, the term 


a <+W a \« 

mdx ) 


This has already been given by J. Neyman and E. S. Pearson, Biometrika , Yol. n A . p. 210. 



418 


Miscellanea 


of Equation (6) tends to unity, thus we infer that 

c£(w)-n(»-l) J dt (15), 

provided w is large enough. On the other hand, when w is very small, we may write 

I f(x) dx—wf(t), 

J <- w/a 

whence we ? infer that 

0( W )=m(«-i)^-» fl m f(*+ f)/(<-f) t/w]’*- 1 * as), 

provided w is small enough. 

There does not appear to be any ready means of determining the accuracy of the approxi- 
mations (15) and (16), but it is possible that they may be of assistance in selecting a suitable 
approximation curve, when proceeding to find a range distribution by indirect methods. 

(2) Application in the Cafe of Samples of 3 from a Normal Population. 

Using Equation (11) the ordinates of the frequency curve, were obtained with the help 

of tables of the normal curve. These ordinates have been compared in Table I with those 
previously calculated from a Pearson Type I curve*. It will be seen that the greatest relative 
difference occurs at the start of the curves. From the practical point of view, we are concerned 


TABLE I. 


« . - , . . . ((A) true theoretical range curve (n = 3). 

Comparison of ordinates of ■{; ' - 7 T1 

r J J ((B) fitted Pearson curve. 


Bange 

Ordinates 

Bange 

Ordinates 

Bange 

Ordinates 

A. 

B. 

A. 

B. 

A. 

B. 

•1 

5-5 

3*6 

1*7 

42*1 

41*2 

3*3 

9-1 

9*6 

*2 

10-9 

9*2 

1*8 

40*5 

39*4 

3*4 

7*8 

8*3 

•3 

16*1 

15*2 

1*9 

38*6 

37*5 

3*5 

6*7 

7*1 

*4 

21-1 

21*1 

2*0 

36*5 

35*3 

3*6 

5*7 

6*1 

•5 

25-7 

26*5 

2*1 

34*2 

33*1 

3*7 

4*8 

5*2 

•6 

29-9 

31*3 

2*2 

31*8 

30-9 

3*8 

4-0 

4*3 

*7 

33-7 

35*4 

2*3 

29*4 

28*6 

3*9 

3-4 

3*6 

•8 

36-9 

38*7 

2*4 

27*0 

26*3 

4*0 

2*8 

3*0 

•9 

39*6 

41 *4 

2*5 

24*6 

24*1 

4*1 

2*3 

2*5 

1‘0 

41*8 

43*3 

2*6 

22*2 

21*9 

4*2 

1*9 

2*0 

1-1 

43*3 

44*5 

2*7 

20*0 

19*8 

4-3 

1*5 

1*6 

1*2 

44*4 

45*1 

2*8 

17*8 

17*8 

4*4 

1*2 

1*3 

1-3 

44-9 

45*2 

2*9 

15*8 

15*9 

4*5 

1*0 

1*0 

1*4 

44*8 

44*8 

3*0 

13*9 

14*1 

4*6 

0*8 

0*8 

1*5 

44*3 

43*9 

3*1 

12*2 

12*5 

4*7 

0*6 

0-6 

1*6 

43*4 

42*7 

3*2 

10*6 

11*0 

4*8 

0-5 

0*5 


N.B. — The unit for range is the population standard deviation, and the curves are calculated 
so that the area under each is 100. The tails of the curves extend of course beyond «i« 4*8. 


* Loc. cit. This curve was made to start at tr=0, and given the correct first three moment 
coefficients. 




Miscellanea 


419 


with the extent of error in position of the percentage limits calculated from the approximate 
curve. This may be seen in Table II, whore the limits are given for: 

A. The true curve of Equation (11), obtained by quadrature and backward interpolation. 

B. The Pearson curve ; the limits are taken from the row, w=3, of the published table*. 

C. A Normal curve having correct mean and standard deviation, namely 1*6926, 
<r= 0*8884. 


TABLE II. 

Percentage limits for range ( n = 3) calculated by various methods . 


Curve used 

Lower Limits 

Upper Limits 

0-5 °/ 0 

l°/o 

® °/o 

© 

o 

o~~~ 

10°/o 

® °/b 

17. 

0-5 % 

A. True curve 

*13 

•19 

•43 

•62 

2*90 

3-31 

4*12 

4*43 

B. Pearson curve 

17 

*22 

•45 

•63 

2*92 

3*34 

4-10 

4-36 

C. Normal curve 

— *60 

-*37 

* i 

i 

•55 

2-83 

3*15 

3*76 

3*98 


The largest difference between A and B (of *07 7 J occurs at the upjKjr 0*5 7 0 limit, the second 
largest (of -04) at the lower 0*5 °/ o limit ; otherwise the differences are *03 or less. Since the 
approximate method might be expected to give least satisfactory agreement for this case of 7* =3 
(where the distribution curve for range has greatest skewness), these results seem to confirm the 
opinion previously given when publishing the tables: “the addition of a 3rd decimal place in 
the limits would clearly be meaningless, but the retention of the 2nd docimal appears worth 
while 

Limits calculated by using a normal curve (C) have been shown in Table II to emphasise 
the fact that while the Pearson curve (B) may not lead to mathematically exact limits, it 
provides a far more accurate and useful approximation than can be obtained by the crude 
method (C). 


(3) Limiting Form 

Finally it seemed of interest to make a trial of the terminal formula (15). 
population sampled is normal, wo have 


Whence we obtain 




A„“[ <t>{w)dw-n{n- 1) T ^ 7 L.dr=n(n-l)xi(l-a x ) 
J iv Jx v2tt 


where t=- ! 0 w, and a, is in the notation of Shoppard’s Tables. 
v 2 


When the 


•(17). 

,( 18 ), 


Loc . cit. p. 416. 


+ Loc. cit . p. 405. 




420 


Miscellanea 


For n<m 3, we obtain from (18), 

A w *06 (5 7. limit) *01 (1 7o) 005 (0*5 °/J 

w 3*39 4-15 4-46 

the last two of which are in very close agreement with the true limits given in Table 11. 
For 7 i =50 and n=100, the following comparisons are obtained: 


Aw 

•05 (5%) 

*01 (17J 

•005 

(Equation (18) 

5-80 

0-31 

6-52 

[Table A of previous paper 

564 

0*23 

6*46 

(Equation (18) 

0*26 

6*72 

0*92 

(Table A of previous paper 

608 

6*63 

6*86 


In these cases the true position of the limits is not of course known. It seems, however, 
likely that for A w ^ *01, the equation (18) may lead to an approximation of considerable practical 
value when limits are required beyond the range of those tabled, e.g. either for A *<005, or 
7l>100. 


(ill) On a Recurrence Relation connected with the Double 
Bessel Function* <K Tl%Tt (x) and T ruT 2 (x). 

By CONSTANCE M. RIGBY, Ph.D. 

A recurrence relation is required for 

where T '*U <*> r ( ,V*) {x) < xxxvii >’ 


and g,„„ («) - y * f/-« (<-l) Tl {t+\) r *dt (Mix), 

where ^^i(ri + r 2 + 1), 

the numbers (xxxvii) and (xxix) referring to the definition of T r ^ r% (x) and 9C r ^ T> (#) in Professor 
Karl Pearson’s paper in Biometrika , Yol. xxv. pp. 168 — 178. 

We have by (xliii) of the same paper 

dx - 1 0 “ (i^l) ) ^i-i.rt-1 W ~ 

Al80 e ~' X i T r»r, <*> - i («-' i n i ,„W)+^-' i n il , 1 (4 

Write e ' Pl7, r l .r,W=^ ll r,(®). 

«)- s£i K^)*) W* C-) - 1+£^) ^ W- 

Integrating, 

^ •» ^ - 1 (’ - (> _ i)*) f a * *** 





Miscellanea 


421 


Multiplying equation (xliii bis) by we have 

{fcf + f rr-UU-l <*> " 2^1 <*> 

' fc- 1 + ((sTTH^ ~ 2^l) ■"■} ^l-l. »t-l (;r) ~ 2^1 Tx { n-1 


Integrating 


r 2 V 2 j / f| — Tjj JJ \ /"* ~ 

Tl ’ T,_ 2v^ i + \(2v _ 1)2 ” 2^1 J J„ X ^ T.-l.r.-l (*)<** 

"IT^l f 0 X dx ^ T i— l. T a— 1 W “k 

_2*' — 2 r / Ti — r 2 P \ f" m 

“Sr-l 7 n-i. »«-i + V(2r - l) s ~ 27—1 J j 0 x ' ^i-i, n-l (*) ** 

“ 27^1 r ^ T i-i. '»-i + 17^1 ^ T i-i. T »-i 

“ ^ 1 - 1 , n-i + ((27-1]* ~ IT^l) W ** 


i^-hthW 


.( 2 ). 


Using (1) to eliminate I xf r , (.t;) d# wo have 
yo 1 i’ 8 1 

1 / r i — r 2 \ 

y— l \2i/ — i *V 


^ T l* T « r 8 -l + 


2v-lV \2 p — 1/ / 


{^.r,W-^.„(0)+(. + ^)/ Tl ,4 

“ 27-1*^., '.'*>> 


* 2^-1 


p2 _to 2 Y 

/ ^ / I zy 1 (r)-f fOV^-l Li / 

r >- T * r «~ l + , _/t, - r,y ' r * T * ( ' ' Tl,T * + /nr n\* /- 

\2y- 1/ \2v —1 / 


2 r lf r t 


2v 




Writing 


r, - r «2 r, - r 2 


ri+r 2 2i/-l * 

fTi _ a A 1# r a ** ^,-1,72-1 + x r? ^ T i* T a ^ ” ^1,^2 ~ 2v ~ 1 ^ T i. T » 


1-k 2 

-p 2 *I-X» T *-A 1— p 

which is the required recurrence formula. 


fB ^„r, (0)} - r~! n i( „W 



CAMBRIDGE! : PRINTED BY 
W. LEWIS, M.A. 

AT THE UNIVERSITY PRESS 




CAMHllIDnK: PRINTED J1Y 
W- LEWIS, M.A. 

AT TIIK UNIVERSITY PRESS 




Bionetrikft, Vol. XXV, Partn 1 and II 





Diagram II. Type Contour of Southern Albanian Group. 
Biometrika, Vol. XXV, Parts I and II 














iyfH»ww>di %• Cowfcowr, WW J®t^ Kwma SlutU 


BtaMtote. ▼•>. XXV, Parti m and IV 




<1 



Horizontal Tv#* Certteur, baaed an 90 $ Kermct Skdk 


Biomatrika, Vol. XXV, Parts HI and IV 












